The Apache OpenNLP library is a machine learning based toolkit for the processing of natural language text. It supports the most common NLP tasks, such...
2 KB (98 words) - 17:19, 25 June 2025
splitting text into sentences - metacpan.org". metacpan.org. "Apache OpenNLP". opennlp.apache.org. "Welcome | FreeLing Home Page". "NLTK :: Natural Language...
6 KB (567 words) - 21:26, 13 September 2024
Retrieved 2024-06-07. "Welcome to Apache Lucene". lucene.apache.org. Retrieved 2024-06-07. "Apache OpenNLP". opennlp.apache.org. Retrieved 2024-06-07. "Alicebot...
40 KB (3,553 words) - 05:49, 26 July 2025
processing OpenCV — library of programming functions mainly for real-time computer vision Tesseract – optical character recognition Apache OpenNLP Apertium...
11 KB (812 words) - 23:52, 6 August 2025
Apache cTAKES: clinical Text Analysis and Knowledge Extraction System is an open-source Natural Language Processing (NLP) system that extracts clinical...
10 KB (882 words) - 03:52, 15 July 2025
Data management system framework Apache Oozie Server-based workflow scheduling system to manage Hadoop jobs. Apache OpenNLP Java machine learning toolkit...
17 KB (12 words) - 20:19, 10 December 2024
This list of Apache Software Foundation projects contains the software development projects of The Apache Software Foundation (ASF). Besides the projects...
38 KB (4,300 words) - 16:50, 29 May 2025
extraction Text mining (also referred to as text data mining) Truecasing Apache OpenNLP spaCy General Architecture for Text Engineering Natural Language Toolkit...
4 KB (462 words) - 20:55, 17 July 2025
al. 2014. Apache OpenNLP includes char n-gram based statistical detector and comes with a model that can distinguish 103 languages Apache Tika contains...
8 KB (917 words) - 21:46, 27 July 2025
Principle-Based Parsing" (PDF). www.vinartus.net. pp. 257–278. Apache OpenNLP OpenNLP includes a chunker. GATE General Architecture for Text Engineering...
3 KB (289 words) - 11:36, 25 June 2025
free Information Extraction system Apache OpenNLP is a Java machine learning toolkit for natural language processing OpenCalais is an automated information...
21 KB (2,541 words) - 00:01, 23 April 2025
Free and open-source software portal Comparison of cryptography libraries Graphics library Harbour libraries and tools List of .NET libraries and frameworks...
31 KB (125 words) - 20:12, 27 June 2025
Spark NLP for optical character recognition (OCR) from images, scanned PDF documents, and DICOM files. It is a software library built on top of Apache Spark...
11 KB (989 words) - 06:10, 14 July 2025
Word Variants, ACM Transactions on Information Systems, 16(1), 61–81 Apache OpenNLP—includes Porter and Snowball stemmers SMILE Stemmer—free online service...
31 KB (3,901 words) - 19:08, 19 November 2024
current, unrelated to patient), and negated/not negated. Also known as Apache cTAKES. DMAP – ETAP-3 – proprietary linguistic processing system focusing...
70 KB (7,763 words) - 00:00, 15 July 2025
list of open-source programming languages and the open-source license it is released under. Free and open-source software portal Free and open-source software...
7 KB (101 words) - 10:43, 27 July 2025
open source project in June 2019 under the Apache 2.0 license BERT - Google LLM released as an open source project in October 2018 under the Apache 2...
79 KB (5,811 words) - 22:59, 5 August 2025
Apache Stanbol is an open source modular software stack and reusable set of components for semantic content management. Apache Stanbol components are meant...
14 KB (1,319 words) - 06:03, 17 January 2025
and open-source software (FOSS) licenses, such as the Apache License, MIT License, and GNU General Public License, outline the terms under which open-source...
75 KB (8,051 words) - 05:25, 25 July 2025
Fast.ai (section Massive Open Online Course)
the first to announce its support. This open-source framework is hosted on GitHub and is licensed under the Apache License, Version 2.0. "Launching fast...
4 KB (409 words) - 19:34, 31 July 2025
Heinrich (2020). "Retrieval-augmented generation for knowledge-intensive NLP tasks". Advances in Neural Information Processing Systems 33: 9459–9474....
24 KB (1,700 words) - 01:34, 8 August 2025
provide a complete Web application framework. OSF is made available under the Apache 2 license. OSF is a platform-independent Web services framework for accessing...
15 KB (1,569 words) - 00:17, 8 July 2025
GPT-J (category Open-source artificial intelligence)
GPT-J and fine-tuned variants. In March 2023, Databricks released Dolly, an Apache-licensed, instruction-following model created by fine-tuning GPT-J on the...
11 KB (1,015 words) - 12:21, 2 February 2025
Deeplearning4j (category Software using the Apache license)
parallel versions that integrate with Apache Hadoop and Spark. Deeplearning4j is open-source software released under Apache License 2.0, developed mainly by...
17 KB (1,378 words) - 02:36, 11 February 2025
5.5 Liberty IBM HTTP Server IBM SDK Apache ZooKeeper Redis Elasticsearch Apache NiFi Apache Kafka Reddison CoreNLP "HCL Commerce 9.1.17". "TradeCentric...
10 KB (995 words) - 05:26, 19 April 2025
libraries and programs for symbolic and statistical natural language processing (NLP) for English written in the Python programming language. It supports classification...
5 KB (336 words) - 18:39, 26 June 2025
reduction Novelty detection Nuisance variable One-class classification Onnx OpenNLP Optimal discriminant analysis Oracle Data Mining Orange (software) Ordination...
39 KB (3,385 words) - 07:36, 7 July 2025
Software Foundation announced its decision to use the Apache License 2.0 for providing the software as open-source. The foundation will make the contributions...
11 KB (959 words) - 12:37, 5 August 2025
GPT-3 (category OpenAI)
consisting of 410 billion byte-pair-encoded tokens. Fuzzy deduplication used Apache Spark's MinHashLSH. Other sources are 19 billion tokens from WebText2 representing...
55 KB (4,892 words) - 14:57, 8 August 2025
permissive Apache License. In January 2025, DeepSeek released DeepSeek R1, a 671-billion-parameter open-weight model that performs comparably to OpenAI o1 but...
125 KB (13,357 words) - 09:43, 8 August 2025