• Thumbnail for Apache Tika
    Apache Tika is a content detection and analysis framework, written in Java, stewarded at the Apache Software Foundation. It detects and extracts metadata...
    6 KB (503 words) - 09:30, 1 August 2024
  • village in Võru County Apache Tika, content analysis software Tika Waylan, a character in the DragonLance series of fantasy novels Tika and The Dissidents...
    2 KB (242 words) - 09:47, 15 September 2022
  • Thumbnail for Chris Mattmann
    studying with Dr. Nenad Medvidović and he went on to invent Apache Tika with Jérôme Charron. Apache Tika is a widely used software framework for content detection...
    8 KB (679 words) - 17:43, 17 June 2024
  • Thumbnail for Apache Nutch
    Apache Nutch is a highly extensible and scalable open source web crawler software project. Nutch is coded entirely in the Java programming language, but...
    13 KB (625 words) - 20:19, 5 January 2025
  • such as Lucene.NET, Mahout, Tika and Nutch. These three are now independent top-level projects. In March 2010, the Apache Solr search server joined as...
    15 KB (1,258 words) - 20:57, 20 June 2025
  • Thumbnail for Panama Papers
    Journalists indexed the documents using open software packages Apache Solr and Apache Tika, and accessed them by means of a custom interface built on top...
    160 KB (14,563 words) - 20:20, 19 June 2025
  • This list of Apache Software Foundation projects contains the software development projects of The Apache Software Foundation (ASF). Besides the projects...
    38 KB (4,300 words) - 16:50, 29 May 2025
  • design paradigm Apache Tapestry Component-oriented Java web application framework Apache Tika Content detection and analysis framework. Apache Tomcat Tomcat...
    17 KB (12 words) - 20:19, 10 December 2024
  • these services. A file Crawler automatically extracts metadata and uses Apache Tika to identify file types and ingest the associated information into the...
    9 KB (960 words) - 19:19, 12 November 2023
  • International Consortium of Investigative Journalists used Blacklight with Apache Tika to comb through the 11.5 million documents from Mossack Fonseca popularly...
    5 KB (427 words) - 08:54, 30 May 2023
  • al. 2014. Apache OpenNLP includes char n-gram based statistical detector and comes with a model that can distinguish 103 languages Apache Tika contains...
    8 KB (917 words) - 19:12, 23 June 2024
  • Thumbnail for USC Viterbi School of Engineering
    the second CEO of Apple Computer, Inc. Chris Mattmann, co-creator of Apache Tika. Mohamed Morsi, Egyptian politician and engineer who served as the fifth...
    22 KB (2,501 words) - 19:49, 27 May 2025
  • StormCrawler (category Software using the Apache license)
    for instance spout and bolts for Elasticsearch and Apache Solr or a ParserBolt which uses Apache Tika to parse various document formats. The project is...
    5 KB (405 words) - 09:53, 5 January 2025
  • Thumbnail for List of Web archiving initiatives
    ReplayWeb.page 1 Ghost Archive Common Crawl United States 2008 Apache Nutch, Apache Tika, pywb, in-house tools 3 3 GFNDC United States (global nodes in...
    118 KB (2,238 words) - 21:51, 14 June 2025
  • list (link) The full list of supported formats is available at: https://tika.apache.org/1.17/formats.html "Tworzenie korpusu — Korpusomat EU 0.1 - dokumentacja"...
    3 KB (297 words) - 13:23, 3 June 2025
  • Thumbnail for Meredith Stiehm
    Say How? A Pronunciation Guide to Names of Public Figures". corpora.tika.apache.org. Retrieved July 26, 2023. Littwin, Susan (November 2004). "In the...
    8 KB (742 words) - 03:16, 13 March 2025
  • Rosamund Pike Southside with You Barack Obama Parker Sawyers Michelle Robinson Tika Sumpter Churchill[citation needed] Winston Churchill Brian Cox Love Under...
    295 KB (197 words) - 02:16, 24 June 2025
  • Thumbnail for North American P-51 Mustang
    P-51 Mustang P-51D nicknamed "Tika IV" of 361st Fighter Group with underwing drop tanks General information Type Fighter National origin United States...
    136 KB (16,491 words) - 17:27, 24 June 2025
  • Thumbnail for Acetobacter aceti
    Program Under Toxic Substances Control Act (TSCA) | US EPA". corpora.tika.apache.org. Retrieved 2024-04-17. Type strain of Acetobacter aceti at BacDive...
    18 KB (2,095 words) - 16:09, 23 June 2025
  • Crafts Board: Southern Plains Indian Museum, Anadarko, Oklahoma". corpora.tika.apache.org. Retrieved 2022-09-11. Bucklew, Joan (1967-06-11). "Wide Range of...
    6 KB (520 words) - 04:20, 15 January 2025
  • Palestine Studies. Retrieved October 24, 2024. "PAOLO BIAGI". corpora.tika.apache.org. Retrieved October 24, 2024. Hamilakis, Yannis; Rojas, Felipe. "Hamilakis...
    91 KB (10,055 words) - 15:49, 15 June 2025
  • singer and rapper Tiitof (born 1995), French rapper and trap music artist Tika (born 1980), Indonesian singer and songwriter Tim (born 1981), Korean-American...
    284 KB (30,608 words) - 18:47, 20 June 2025
  • C. Brown to Peter P. Pitchlynn. Re: rumors of a band of Comanches and Apaches of hostile nature gathering. "Peter P. Pitchlynn Collection" Archived 17...
    582 KB (5,257 words) - 17:21, 24 June 2025
  • Thumbnail for History of Colorado
    of peoples that lived in the valleys and mesas of the Colorado Plateau Apache Nation — An Athabaskan-speaking nation that lived in the Great Plains in...
    51 KB (5,985 words) - 06:13, 22 June 2025
  •  2016 (2016-08-23) Rami Malek, Tika Sumpter & Parker Sawyers Diana Gordon The Late Show Thing-O-Meter. Rami Malek discusses Mr. Robot. Tika Sumpter and Parker Sawyers...
    103 KB (112 words) - 02:00, 29 April 2025