-
Notifications
You must be signed in to change notification settings - Fork 1.2k
Issues: DS4SD/docling
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Headers not detected in old docx files
bug
Something isn't working
#1032
opened Feb 21, 2025 by
DuritaKJ
Token indices sequence length is longer than the specified maximum sequence length for this model (530 > 512). Running this sequence through the model will result in indexing errors
bug
Something isn't working
chunker
#1026
opened Feb 20, 2025 by
nicolofranceschi
Export to markdown only contains H2 headers
enhancement
New feature or request
#1023
opened Feb 19, 2025 by
nikhildigde
Avoid testing exact JSON output
tests
issue related to changes needed in the tests
#1022
opened Feb 19, 2025 by
dolfim-ibm
6 of 7 tasks
I don't understand why after removing the <a> and <span> tags the names and emails are not present in the markdown convertion
bug
Something isn't working
html
issue related to html backend
#1019
opened Feb 19, 2025 by
kupelabs
Automatically detect PDFs requiring force OCR
enhancement
New feature or request
layout
#1014
opened Feb 18, 2025 by
Fogapod
Seems like EasyOCR is not using GPU
question
Further information is requested
#1013
opened Feb 18, 2025 by
nikhildigde
Add pydantic base type support for page and table metadata
enhancement
New feature or request
#1005
opened Feb 18, 2025 by
ScottHMcKean
Add more information to <a> and <img> tags in HTML Backend
#1002
opened Feb 17, 2025 by
alex-james-bit
Do we have features like do_picture_description for word,pptx and other formats using VLM's as PdfpipelineOptions would be specific to PDF formats correct.
question
Further information is requested
#1001
opened Feb 17, 2025 by
arya18mak
IndexError while processing a PDF file
bug
Something isn't working
pdf parsing
PDF issue related to docling-parse
#1000
opened Feb 17, 2025 by
tomasamenezes
docling dependancies are conflicting the wdu dependancies
enhancement
New feature or request
#996
opened Feb 17, 2025 by
oprince
Latest version 2.20 Read pdf is very slow
question
Further information is requested
#995
opened Feb 17, 2025 by
langzichai
HybridChunker
not available with just docling
as dependency
bug
#994
opened Feb 17, 2025 by
sanmai-NL
Picture Description in Output
question
Further information is requested
#993
opened Feb 16, 2025 by
rhlarora84
pdf conversion output all numbers
bug
Something isn't working
pdf parsing
PDF issue related to docling-parse
#968
opened Feb 14, 2025 by
warm-july
Apply export_to_markdown to individual document items
enhancement
New feature or request
#962
opened Feb 14, 2025 by
simonschoe
Docling Produces Unreadable Text Output for PDFs
bug
Something isn't working
pdf parsing
PDF issue related to docling-parse
#960
opened Feb 13, 2025 by
josk0
Decrease CI network failures
bug
Something isn't working
enhancement
New feature or request
#959
opened Feb 13, 2025 by
dolfim-ibm
[ Feat. ] Please integrate radio-selected / radio-unselected similar to checkboxes.
enhancement
New feature or request
#956
opened Feb 13, 2025 by
DeezNutz6942O
Document export as markdown missing out some texts
bug
Something isn't working
#953
opened Feb 13, 2025 by
penquin17
docling_parse_v2 split/connect words
bug
Something isn't working
pdf parsing
PDF issue related to docling-parse
#952
opened Feb 12, 2025 by
InbarShapira
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.