Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
The Top 10 Tika Open Source Projects
Open source projects categorized as Tika
Categories
>
Data Processing
>
Tika
Edit Category
laurilehmijoki/s3_website
⭐
2,259
Manage an S3 website: sync, deliver via CloudFront, benefit from advanced S3 website features.
dependent packages
0
total releases
0
most recent commit
about 3 years ago
apache/tika
⭐
2,007
The Apache Tika toolkit detects and extracts metadata and text from over a thousand different file types (such as PPT, XLS, and PDF).
dependent packages
0
total releases
0
most recent commit
about 2 years ago
chrismattmann/tika-python
⭐
1,316
Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.
dependent packages
0
total releases
0
most recent commit
over 2 years ago
dadoonet/fscrawler
⭐
1,279
Elasticsearch File System Crawler (FS Crawler)
dependent packages
0
total releases
0
most recent commit
about 2 years ago
pemistahl/lingua
⭐
622
The most accurate natural language detection library for Java and the JVM, suitable for long and short text alike
dependent packages
0
total releases
0
most recent commit
over 2 years ago
ICIJ/datashare
⭐
519
A self-hosted search engine for documents.
dependent packages
0
total releases
0
most recent commit
about 2 years ago
USCDataScience/sparkler
⭐
401
Spark-Crawler: Apache Nutch-like crawler that runs on Apache Spark.
dependent packages
0
total releases
0
most recent commit
about 3 years ago
pcbje/gransk
⭐
237
Document processing for investigations
dependent packages
0
total releases
0
most recent commit
over 9 years ago
ICIJ/extract
⭐
229
A cross-platform command line tool for parallelised content extraction and analysis.
dependent packages
0
total releases
0
most recent commit
about 2 years ago
michaelklishin/pantomime
⭐
171
A tiny Clojure library that deals with MIME types (Internet media types)
dependent packages
0
total releases
0
most recent commit
about 7 years ago
Get A Weekly Email With Trending Tika Projects
No Spam. Unsubscribe easily at any time.
Tika
Subscribe
Javascript must be enabled to subscribe.
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2026 Awesome Open Source. All rights reserved.