CARVIEW |
Select Language
HTTP/2 302
server: nginx
date: Fri, 08 Aug 2025 04:58:19 GMT
content-type: text/plain; charset=utf-8
content-length: 0
x-archive-redirect-reason: found capture at 20090221075149
location: https://web.archive.org/web/20090221075149/https://github.com/rnewson
server-timing: captures_list;dur=1.093188, exclusion.robots;dur=0.032920, exclusion.robots.policy;dur=0.013791, esindex;dur=0.018202, cdx.remote;dur=2647.559933, LoadShardBlock;dur=254.619099, PetaboxLoader3.datanode;dur=104.386700, PetaboxLoader3.resolve;dur=56.156561
x-app-server: wwwb-app213
x-ts: 302
x-tr: 2950
server-timing: TR;dur=0,Tw;dur=0,Tc;dur=0
set-cookie: wb-p-SERVER=wwwb-app213; path=/
x-location: All
x-rl: 0
x-na: 0
x-page-cache: MISS
server-timing: MISS
x-nid: DigitalOcean
referrer-policy: no-referrer-when-downgrade
permissions-policy: interest-cohort=()
HTTP/2 200
server: nginx
date: Fri, 08 Aug 2025 04:58:20 GMT
content-type: text/html; charset=utf-8
x-archive-orig-server: nginx/0.6.31
x-archive-orig-date: Sat, 21 Feb 2009 07:51:49 GMT
x-archive-orig-connection: close
x-archive-orig-status: 200 OK
x-archive-orig-x-runtime: 78ms
x-archive-orig-etag: "871456c1348edc9d8858ea3ae692c538"
x-archive-orig-cache-control: private, max-age=0, must-revalidate
x-archive-orig-content-length: 29904
x-archive-guessed-content-type: text/html
x-archive-guessed-charset: utf-8
memento-datetime: Sat, 21 Feb 2009 07:51:49 GMT
link: ; rel="original", ; rel="timemap"; type="application/link-format", ; rel="timegate", ; rel="first memento"; datetime="Sat, 21 Feb 2009 07:51:49 GMT", ; rel="memento"; datetime="Sat, 21 Feb 2009 07:51:49 GMT", ; rel="next memento"; datetime="Thu, 02 Apr 2009 16:47:22 GMT", ; rel="last memento"; datetime="Sat, 21 Sep 2024 08:11:35 GMT"
content-security-policy: default-src 'self' 'unsafe-eval' 'unsafe-inline' data: blob: archive.org web.archive.org web-static.archive.org wayback-api.archive.org athena.archive.org analytics.archive.org pragma.archivelab.org wwwb-events.archive.org
x-archive-src: 52_8_20090221061128_crawl103-c/52_8_20090221075007_crawl101.arc.gz
server-timing: captures_list;dur=0.890872, exclusion.robots;dur=0.030028, exclusion.robots.policy;dur=0.014948, esindex;dur=0.053473, cdx.remote;dur=49.107533, LoadShardBlock;dur=226.356048, PetaboxLoader3.datanode;dur=766.362088, PetaboxLoader3.resolve;dur=162.060540, load_resource;dur=715.463489
x-app-server: wwwb-app213
x-ts: 200
x-tr: 1061
server-timing: TR;dur=0,Tw;dur=0,Tc;dur=0
x-location: All
x-rl: 0
x-na: 0
x-page-cache: MISS
server-timing: MISS
x-nid: DigitalOcean
referrer-policy: no-referrer-when-downgrade
permissions-policy: interest-cohort=()
content-encoding: gzip
rnewson's Profile - GitHub
Public Activity
-
Unofficial Git mirror of CouchDB svn repository. (Updated every 10 mins)Forked from halorgium/couchdb Fri Feb 13 19:16:16 -0800 2009
-
Enables full-text searching of CouchDB documents using LuceneCreated Sun Jan 25 07:37:12 -0800 2009
Public Activity
"couchdb-lucene.jar" is at rnewson/couchdb-lucene/downloads
439943f96e07668fe88895b0465a719b6e559836
handle document ID's with spaces in them (ensure they're indexed as a single token).
48192b1b1757601f6526bb623b9b93e437c2adc6
encode document ID as well, it might contain spaces, etc.
4ddea0a601f5e758a099e863c7b48ea0372f1239
perform URL-escaping for attachment names, add some debugging.
6ca08c31146ef99924de4c0b8a87c587cc0dd66b
update TODO
rnewson
deleted branch tika at rnewson/couchdb-lucene
Wed Feb 18 14:33:46 -0800 2009
Deleted branch was at rnewson/couchdb-lucene/tree/tika
ec94e218d5fc84eaa9959c2065097f56cf03a702
updated README.md
4a60080428527a134f77b7f62365f7245d60d80b
use couchdb's content_type rather than auto-detect.
2a4e7671f7e98daa8a53b4ff7c24b0d7b17d9794
use Apache Tika to extract content of Word/PDF/XLS, etc. *very* alpha.
New branch is at rnewson/couchdb-lucene/tree/tika
118d28ebed78c22fd1735eeb0668b152b9424f29
JSON example output.
« NewerOlder »
This feature is coming soon. Sit tight!