opendatalab/MinerU

Python

Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.

49/ 100

Activity

Stable

Score Breakdown

25
Issue Response
/ 25 pts
18
PR Merge Rate
/ 20 pts
0
External Contributions
/ 15 pts
1
Commit Distribution
/ 15 pts
0
Good First Issues
/ 10 pts
5
Complexity
/ 10 pts
0
CONTRIBUTING
/ 4 pts
0
Code of Conduct
/ 1 pts
0
No Penalty
Repo is code
0
No Penalty
Repo is active

Repository Stats

Stars
51.9k
Forks
4.3k
Open Issues
164
Lines of Code
45.8k
Contributors
77

Contributor Friendliness

Good First Issues
0
Help Wanted
0
Avg Issue Response
4m
Avg PR Merge Time
4h
Avg PR Review Time
3h
Issue Close Rate
64%

Project Info

Created
Feb 29, 2024
Age
1 year old
Last Push
2d ago
Size
145.0 MB

Contributors also worked on

Ebazhanov/linkedin-skill-assessments-quizzes

Python

Full reference of LinkedIn answers 2024 for skill assessments (aws-lambda, rest-api, javascript, react, git, html, jquery, mongodb, java, Go, python, machine-learning, power-point) linkedin excel test lösungen, linkedin machine learning test LinkedIn test questions and answers

28.7k1.6k
58

nvim-lua/kickstart.nvim

Lua

A launch point for your personal nvim configuration

28.8k163
40

songquanpeng/one-api

JavaScript

LLM API 管理 & 分发系统,支持 OpenAI、Azure、Anthropic Claude、Google Gemini、DeepSeek、字节豆包、ChatGLM、文心一言、讯飞星火、通义千问、360 智脑、腾讯混元等主流模型,统一 API 适配,可用于 key 管理与二次分发。单可执行文件,提供 Docker 镜像,一键部署,开箱即用。LLM API management & key redistribution system, unifying multiple providers under a single API. Single binary, Docker-ready, with an English UI.

28.7k135
35

JaidedAI/EasyOCR

Python

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

28.6k112
32

eugeneyan/applied-ml

📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.

28.6k47
28

kenwheeler/slick

JavaScript

the last carousel you'll ever need

28.6k165
23

ZuzooVn/machine-learning-for-software-engineers

A complete daily plan for studying to become a machine learning engineer.

28.7k38
17

abhisheknaiidu/awesome-github-profile-readme

😎 A curated list of awesome GitHub Profile which updates in real time

28.7k180
16

donnemartin/data-science-ipython-notebooks

Python

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

28.8k12
6

dnSpy/dnSpy

C#

.NET debugger and assembly editor

28.7k67
6