Text Generation
Safetensors
English
Chinese
qwen3
commoncrawl
html-extraction
content-extraction
information-extraction
qwen
conversational
Instructions to use opendatalab/MinerU-HTML with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Inference
- Xet hash:
- 13503d4d1f585fe6b50c9760275c70093b815f9c0ee65675adfa11cd771bc9d5
- Size of remote file:
- 1.5 GB
- SHA256:
- 32e21cff0eb06c724b26d00090f6199c90ceb2717438c8a83544eb5fb7878acd
·
Xet efficiently stores Large Files inside Git, intelligently splitting files into unique chunks and accelerating uploads and downloads. More info.