Document classification or document categorization is a problem in library science, information science and computer science. The task is to assign a...
13 KB (1,390 words) - 18:09, 7 July 2025
over a logical knowledge database. A document retrieval system consists of a database of documents, a classification algorithm to build a full text index...
6 KB (723 words) - 00:25, 3 December 2023
Superintendent of Documents Classification, commonly called as SuDocs or SuDoc, is a system of library classification developed and maintained by the...
12 KB (1,235 words) - 17:55, 26 May 2025
Naive Bayes classifier (redirect from Naive Bayesian classification)
event model typically used for document classification, with events representing the occurrence of a word in a single document (see bag of words assumption)...
50 KB (7,375 words) - 15:22, 22 July 2025
Classified information (redirect from Classified document)
is technically not a classification level. Though this is a feature of some classification schemes, used for government documents that do not merit a particular...
81 KB (6,764 words) - 07:45, 19 July 2025
multiplicity. The bag-of-words model is commonly used in methods of document classification where, for example, the (frequency of) occurrence of each word...
8 KB (926 words) - 02:02, 12 May 2025
IEC 61355 (section Classification code)
61355-1 Classification and designation of documents for plants, systems and equipment describes rules and guidelines for the uniform classification and identification...
15 KB (344 words) - 14:38, 16 April 2025
Linear classifier (redirect from Linear classification)
features. Such classifiers work well for practical problems such as document classification, and more generally for problems with many variables (features)...
9 KB (1,146 words) - 02:44, 21 October 2024
standardized document classification and automated information extraction Cui, Lei; Xu, Yiheng; Lv, Tengchao; Wei, Furu (2021). "Document AI: Benchmarks...
5 KB (579 words) - 23:15, 24 May 2025
Categorization Classification (general theory) Decimal classification Document classification Information retrieval Knowledge organization Library management...
20 KB (2,307 words) - 15:30, 7 July 2025
a query classification algorithm. However, the computation of query classification is non-trivial. Different from the document classification tasks, queries...
8 KB (1,099 words) - 22:29, 3 January 2025
Redaction (redirect from Document sanitizing)
information, redaction attempts to reduce the document's classification level, possibly yielding an unclassified document. When the intent is privacy protection...
13 KB (1,436 words) - 00:03, 7 July 2025
paper documents or importing electronic documents, often for the purposes of feeding advanced document classification and data collection processes. Most...
6 KB (765 words) - 15:13, 21 July 2024
includes self-supervised learning, generative adversarial networks, document classification and translation, and computer vision. FAIR released Torch deep-learning...
17 KB (1,318 words) - 07:09, 22 July 2025
displaying short descriptions of redirect targets Document classification – Process of categorizing documents Drug discovery and development – Process of bringing...
13 KB (1,899 words) - 17:53, 15 July 2024
Potthast, Martin (2023). "Trigger Warning Assignment as a Multi-Label Document Classification Problem". Proceedings of the 61st Annual Meeting of the Association...
10 KB (1,052 words) - 17:02, 22 July 2025
Taxonomy (redirect from Scientific classification)
classification of things or concepts, as well as to the principles underlying such work. Thus a taxonomy can be used to organize species, documents,...
56 KB (6,859 words) - 16:33, 28 June 2025
as a document classification ontology, or simply as a way to describe any kind of document in RDF. It has been inspired by many existing document description...
2 KB (153 words) - 09:57, 9 June 2025
the document using a scanner and the phase of interpreting the document, for example using natural language processing (NLP) or image classification technologies...
15 KB (1,538 words) - 21:09, 23 June 2025
A document management system (DMS) is usually a computerized system used to store, share, track and manage files or documents. Some systems include history...
28 KB (1,550 words) - 20:37, 29 May 2025
The Universal Decimal Classification (UDC) is a bibliographic and library classification representing the systematic arrangement of all branches of human...
60 KB (2,818 words) - 19:14, 18 July 2025
See also: Baca & Harpring (2000) and Shatford (1986). Aboutness Document classification Subject indexing Subject access Subject term Topic-comment Saracevic...
18 KB (2,656 words) - 15:01, 24 May 2025
field of information retrieval for measuring search, document classification, and query classification performance. It is particularly relevant in applications...
16 KB (2,486 words) - 20:57, 19 June 2025
Taxonomy (biology) (redirect from Systems of zoological classification)
and classification The science of classification, in biology the arrangement of organisms into a classification "The science of classification as applied...
69 KB (6,956 words) - 07:28, 19 July 2025
revolutionized NLP tasks like sentiment analysis, machine translation, and document classification. Computer vision: Image and video embeddings enable tasks like...
10 KB (1,191 words) - 02:22, 27 June 2025
Classified information in the United States (redirect from United States government classification system)
confidential. The U.S. no longer has a Restricted classification, but many other countries and NATO documents do. The U.S. treats Restricted information it...
94 KB (10,052 words) - 18:00, 13 July 2025
In machine learning, one-class classification (OCC), also known as unary classification or class-modelling, tries to identify objects of a specific class...
17 KB (2,323 words) - 12:01, 25 April 2025
Latent semantic analysis (section Term-document matrix)
used to: Compare the documents in the low-dimensional space (data clustering, document classification). Find similar documents across languages, after...
58 KB (7,629 words) - 00:39, 14 July 2025
evolution of language modelling. Consider a simple problem of document classification, where we want to assign a label (e.g., "spam", "not spam", "politics"...
7 KB (893 words) - 06:39, 24 June 2025
Digital mailroom (section Document classification)
processes. Using document scanning and document capture technologies, companies can digitise incoming mail and automate the classification and distribution...
13 KB (1,818 words) - 06:20, 12 July 2025