| CARVIEW |
Select Language
HTTP/2 200
date: Sun, 28 Dec 2025 09:59:27 GMT
content-type: text/html; charset=utf-8
vary: X-PJAX, X-PJAX-Container, Turbo-Visit, Turbo-Frame, X-Requested-With,Accept-Encoding, Accept, X-Requested-With
etag: W/"3be9010f38be02a9427cde7a69ab9db0"
cache-control: max-age=0, private, must-revalidate
strict-transport-security: max-age=31536000; includeSubdomains; preload
x-frame-options: deny
x-content-type-options: nosniff
x-xss-protection: 0
referrer-policy: no-referrer-when-downgrade
content-security-policy: default-src 'none'; base-uri 'self'; child-src github.githubassets.com github.com/assets-cdn/worker/ github.com/assets/ gist.github.com/assets-cdn/worker/; connect-src 'self' uploads.github.com www.githubstatus.com collector.github.com raw.githubusercontent.com api.github.com github-cloud.s3.amazonaws.com github-production-repository-file-5c1aeb.s3.amazonaws.com github-production-upload-manifest-file-7fdce7.s3.amazonaws.com github-production-user-asset-6210df.s3.amazonaws.com *.rel.tunnels.api.visualstudio.com wss://*.rel.tunnels.api.visualstudio.com github.githubassets.com objects-origin.githubusercontent.com copilot-proxy.githubusercontent.com proxy.individual.githubcopilot.com proxy.business.githubcopilot.com proxy.enterprise.githubcopilot.com *.actions.githubusercontent.com wss://*.actions.githubusercontent.com productionresultssa0.blob.core.windows.net/ productionresultssa1.blob.core.windows.net/ productionresultssa2.blob.core.windows.net/ productionresultssa3.blob.core.windows.net/ productionresultssa4.blob.core.windows.net/ productionresultssa5.blob.core.windows.net/ productionresultssa6.blob.core.windows.net/ productionresultssa7.blob.core.windows.net/ productionresultssa8.blob.core.windows.net/ productionresultssa9.blob.core.windows.net/ productionresultssa10.blob.core.windows.net/ productionresultssa11.blob.core.windows.net/ productionresultssa12.blob.core.windows.net/ productionresultssa13.blob.core.windows.net/ productionresultssa14.blob.core.windows.net/ productionresultssa15.blob.core.windows.net/ productionresultssa16.blob.core.windows.net/ productionresultssa17.blob.core.windows.net/ productionresultssa18.blob.core.windows.net/ productionresultssa19.blob.core.windows.net/ github-production-repository-image-32fea6.s3.amazonaws.com github-production-release-asset-2e65be.s3.amazonaws.com insights.github.com wss://alive.github.com wss://alive-staging.github.com api.githubcopilot.com api.individual.githubcopilot.com api.business.githubcopilot.com api.enterprise.githubcopilot.com; font-src github.githubassets.com; form-action 'self' github.com gist.github.com copilot-workspace.githubnext.com objects-origin.githubusercontent.com; frame-ancestors 'none'; frame-src viewscreen.githubusercontent.com notebooks.githubusercontent.com; img-src 'self' data: blob: github.githubassets.com media.githubusercontent.com camo.githubusercontent.com identicons.github.com avatars.githubusercontent.com private-avatars.githubusercontent.com github-cloud.s3.amazonaws.com objects.githubusercontent.com release-assets.githubusercontent.com secured-user-images.githubusercontent.com/ user-images.githubusercontent.com/ private-user-images.githubusercontent.com opengraph.githubassets.com marketplace-screenshots.githubusercontent.com/ copilotprodattachments.blob.core.windows.net/github-production-copilot-attachments/ github-production-user-asset-6210df.s3.amazonaws.com customer-stories-feed.github.com spotlights-feed.github.com objects-origin.githubusercontent.com *.githubusercontent.com; manifest-src 'self'; media-src github.com user-images.githubusercontent.com/ secured-user-images.githubusercontent.com/ private-user-images.githubusercontent.com github-production-user-asset-6210df.s3.amazonaws.com gist.github.com github.githubassets.com; script-src github.githubassets.com; style-src 'unsafe-inline' github.githubassets.com; upgrade-insecure-requests; worker-src github.githubassets.com github.com/assets-cdn/worker/ github.com/assets/ gist.github.com/assets-cdn/worker/
server: github.com
content-encoding: gzip
accept-ranges: bytes
set-cookie: _gh_sess=DuvyB7nKEsxfQROjAFS7G385s257ES1W7CNZlxcyaDDPv1FYOqDXOE9Qvg0uvr9VWEoU2TXfXKRd5g50Og2U5THmQ3EDOJ1t2XGI%2F4wLcXSTjolGLt2IHS3HXFEurpbTGWFbHgMo7r9ZmHfL0I13zsqpgY9pkTeZB%2FiBNmrZaE%2BzWii3QMpTFq6axCZ156Ag6zadJoAvEHAmh7Shnf9NQRD7kHtwM9ZlT%2FjJTYDXJzbKR1SB1oBKpJstKLa1yS3ubyGZAAH08rvAV4U2%2Bvee0Q%3D%3D--MYqiLSwlI%2BNNe63G--7BWzVg%2Byqs3kbn90acXHeg%3D%3D; Path=/; HttpOnly; Secure; SameSite=Lax
set-cookie: _octo=GH1.1.2117843874.1766915966; Path=/; Domain=github.com; Expires=Mon, 28 Dec 2026 09:59:26 GMT; Secure; SameSite=Lax
set-cookie: logged_in=no; Path=/; Domain=github.com; Expires=Mon, 28 Dec 2026 09:59:26 GMT; HttpOnly; Secure; SameSite=Lax
x-github-request-id: 8248:329234:4DE66EE:5E32E8F:6950FF7E
Releases · Unstructured-IO/unstructured · GitHub
10 Dec 17:56
Loading
24 Nov 14:55
Loading
15 Nov 00:14
Loading
07 Nov 01:05
Loading
17 Sep 14:27
Loading
26 Aug 13:25
Loading
13 Aug 23:41
Loading
28 Jul 19:02
Loading
23 Jul 13:32
Loading
18 Jul 17:31
Loading
Skip to content
Navigation Menu
{{ message }}
-
Notifications
You must be signed in to change notification settings - Fork 1.1k
Releases: Unstructured-IO/unstructured
Releases · Unstructured-IO/unstructured
0.18.22
afd9118
This commit was created on GitHub.com and signed with GitHub’s verified signature.
Assets 2
0.18.21
91a9888
This commit was created on GitHub.com and signed with GitHub’s verified signature.
0.18.21
Enhancement
- Update save_elements unit test to check crop box padding behavior
Features
Fixes
- Update
unstructured-inferenceto 1.1.2 to address CVEs
Assets 2
0.18.20
7c4d0b9
This commit was created on GitHub.com and signed with GitHub’s verified signature.
0.18.20
Enhancement
- Improve the VoyageAI integration
- Add voyage-context-3 support
- Flag extracted elements as such in the metadata for downstream use
Features
Fixes
Assets 2
0.18.18
b01d35b
This commit was created on GitHub.com and signed with GitHub’s verified signature.
0.18.18
Fixes
- Prevent path traversal in email MSG attachment filenames Fixed a security vulnerability (GHSA-gm8q-m8mv-jj5m) where malicious attachment filenames containing path traversal sequences could write files outside the intended directory. The fix normalizes both Unix and Windows path separators before sanitizing filenames, preventing cross-platform path traversal attacks in
partition_msgfunctions
0.18.17
Enhancement
Features
Fixes
- Removed
Clarifaidependency as it is no longer used - Bumped dependencies via pip-compile to address the following CVEs:
- pypdf: GHSA-vr63-x8vc-m265
- pip: GHSA-4xh5-x5gv-qwph
- uv: GHSA-8qf3-x8v5-2pj8 GHSA-pqhf-p39g-3x64
0.18.16
Enhancement
- Speed up function _assign_hash_ids by 34% (codeflash)
Features
Fixes
- Bumped dependencies via pip-compile to address the following CVEs:
- authlib: GHSA-pq5p-34cr-23v9
- python-3.12/python03.12-base: CVE-2025-8291, GHSA-49g5-f6qw-8mm7
- libcrypto3/libssl3: CVE-2025-9230, CVE-2025-9231, CVE-2025-9232, GHSA-76r2-c3cg-f5r9, GHSA-9mrx-mqmg-gwj9
Assets 2
0.18.15
2d44d73
This commit was created on GitHub.com and signed with GitHub’s verified signature.
What's Changed
- Setup Codeflash Github Actions to optimize all future code by @misrasaurabh1 in #4082
- fix: update deps to resolve cve by @qued in #4093
- ⚡️ Speed up function
group_broken_paragraphsby 30% by @aseembits93 in #4088 - ⚡️ Speed up method
ElementHtml._get_children_htmlby 234% by @aseembits93 in #4087 - Luke/sept16 CVE by @luke-kucing in #4094
New Contributors
- @aseembits93 made their first contribution in #4088
Full Changelog: 0.18.14...0.18.15
Assets 2
0.18.14
fed8942
This commit was created on GitHub.com and signed with GitHub’s verified signature.
0.18.14
Enhancements
-
Speed up function sentence_count by 59% (codeflash)
-
Speed up function
check_for_nltk_packageby 111% (codeflash) -
Speed up function
under_non_alpha_ratioby 76% (codeflash)
Features
Fixes
- change short text language detection log to debug reduce warning level log spamming
- Bumped dependencies via pip-compile to address the following CVEs:
- Python 3.12/3.13: CVE-2025-8194, GHSA-v594-44hm-2j7p
- glibc & related (glibc, glibc-locale-posix, ld-linux, libcrypt1): CVE-2025-8058, GHSA-8xjp-c72j-67q8
- aiohttp: GHSA-9548-qrrj-x5pj
- openjpeg: CVE-2025-54874
- pypdf: GHSA-7hfw-26vp-jp8m
- transformers: GHSA-9356-575x-2w9m
- urllib3: GHSA-48p4-8xcf-vxj5
- Bumped dependencies via pip-compile to address the following CVEs:
Assets 2
0.18.13
0d20f6a
This commit was created on GitHub.com and signed with GitHub’s verified signature.
0.18.13
Fixes
Parse a wider variety of date formats in email headers The partition_email function is now more robust to non-standard date formats, including ISO-8601 dates with "Z" suffixes. This prevents ValueError exceptions when partitioning emails with these date formats.
Assets 2
0.18.12
b8c14a7
This commit was created on GitHub.com and signed with GitHub’s verified signature.
What's Changed
- Prevent large file content in encoding exceptions Replace UnicodeDecodeError with UnprocessableEntityError in encoding detection to avoid storing entire file content in exception objects, which can cause issues in logging and error reporting systems when processing large files.
Full Changelog: 0.18.11...0.18.12
Assets 2
0.18.11
591729c
This commit was created on GitHub.com and signed with GitHub’s verified signature.
What's Changed
- add '|' as a delimiter in csv files by @jiajun-unstructured in #4059
- feat: map tags by
type+ add coverage by @MaksOpp in #4068 - chore: switch to charset normalizer by @qued in #4060
- bump version and release by @MaksOpp in #4070
Full Changelog: 0.18.10...0.18.11
Assets 2
0.18.10
a040483
This commit was created on GitHub.com and signed with GitHub’s verified signature.
0.18.10
Enhancements
Features
- Add OCR_AGENT_CACHE_SIZE environment variable Added configurable cache size for OCR agents to control memory usage.
Assets 2
Previous Next
You can’t perform that action at this time.