CARVIEW

MOTORHOMES

Select Language

HTTP/2 302 server: nginx date: Fri, 22 Aug 2025 04:16:09 GMT content-type: text/plain; charset=utf-8 content-length: 0 x-archive-redirect-reason: found capture at 20100106104904 location: https://web.archive.org/web/20100106104904/https://github.com/developmentseed/extractor server-timing: captures_list;dur=0.874773, exclusion.robots;dur=0.028118, exclusion.robots.policy;dur=0.012524, esindex;dur=0.014251, cdx.remote;dur=91.199778, LoadShardBlock;dur=163.430896, PetaboxLoader3.datanode;dur=72.415831, PetaboxLoader3.resolve;dur=65.126965 x-app-server: wwwb-app220 x-ts: 302 x-tr: 305 server-timing: TR;dur=0,Tw;dur=0,Tc;dur=0 set-cookie: wb-p-SERVER=wwwb-app220; path=/ x-location: All x-rl: 0 x-na: 0 x-page-cache: MISS server-timing: MISS x-nid: DigitalOcean referrer-policy: no-referrer-when-downgrade permissions-policy: interest-cohort=() HTTP/2 200 server: nginx date: Fri, 22 Aug 2025 04:16:10 GMT content-type: text/html; charset=utf-8 x-archive-orig-server: nginx/0.7.61 x-archive-orig-date: Wed, 06 Jan 2010 10:49:04 GMT x-archive-orig-connection: close x-archive-orig-status: 200 OK x-archive-orig-etag: "32e16c7337ad040eaa048bd4ae843aab" x-archive-orig-x-runtime: 1059ms x-archive-orig-content-length: 22063 x-archive-orig-cache-control: private, max-age=0, must-revalidate x-archive-guessed-content-type: text/html x-archive-guessed-charset: utf-8 memento-datetime: Wed, 06 Jan 2010 10:49:04 GMT link: ; rel="original", ; rel="timemap"; type="application/link-format", ; rel="timegate", ; rel="first memento"; datetime="Fri, 23 Oct 2009 10:37:07 GMT", ; rel="prev memento"; datetime="Tue, 01 Dec 2009 12:48:40 GMT", ; rel="memento"; datetime="Wed, 06 Jan 2010 10:49:04 GMT", ; rel="next memento"; datetime="Fri, 12 Feb 2010 11:03:40 GMT", ; rel="last memento"; datetime="Wed, 29 Feb 2012 08:45:04 GMT" content-security-policy: default-src 'self' 'unsafe-eval' 'unsafe-inline' data: blob: archive.org web.archive.org web-static.archive.org wayback-api.archive.org athena.archive.org analytics.archive.org pragma.archivelab.org wwwb-events.archive.org x-archive-src: 51_13_20100106065057_crawl102-c/51_13_20100106104744_crawl101.arc.gz server-timing: captures_list;dur=0.526426, exclusion.robots;dur=0.019033, exclusion.robots.policy;dur=0.009364, esindex;dur=0.010548, cdx.remote;dur=16.768182, LoadShardBlock;dur=150.997110, PetaboxLoader3.datanode;dur=150.451293, PetaboxLoader3.resolve;dur=102.217546, load_resource;dur=110.377051 x-app-server: wwwb-app220 x-ts: 200 x-tr: 327 server-timing: TR;dur=0,Tw;dur=0,Tc;dur=0 x-location: All x-rl: 0 x-na: 0 x-page-cache: MISS server-timing: MISS x-nid: DigitalOcean referrer-policy: no-referrer-when-downgrade permissions-policy: interest-cohort=() content-encoding: gzip developmentseed's extractor at master - GitHub

developmentseed / extractor

Source
Commits
Network (1)
Issues (1)
Graphs
Branch: master

Branches (1)
- master ✓
Tags (3)

API for extracting terms from text — Read more

https://code.developmentseed.org/extractor

This URL has Read+Write access

Merge branch 'master' of git@github.com:developmentseed/extractor

Alex Barth (author)

Thu Nov 12 07:56:56 -0800 2009

commit 66b9a9bc59bbf85e026a7b85d33fcc45e7b1225a
tree f9cc8746a1ab69222a56b51a5b1bf3a3c804b474
parent 712220e66452980f141b489bb026b8c272bac9c3 parent e821e36836dbd3a0879f70ea52e0658b74b0504c

extractor /

name	age	history message
ExtractorSimplePieParser.inc	Mon Nov 02 12:46:21 -0800 2009	New format of return value of getMappingSources(). [Alex Barth]
ExtractorSyndicationParser.inc	Mon Nov 02 12:46:21 -0800 2009	New format of return value of getMappingSources(). [Alex Barth]
README.txt	Tue Oct 20 13:37:37 -0700 2009	Initial commit of extractor to github. [yhahn]
extractor.info	Tue Oct 20 13:37:37 -0700 2009	Initial commit of extractor to github. [yhahn]
extractor.module	Tue Oct 20 13:37:37 -0700 2009	Initial commit of extractor to github. [yhahn]
libraries/	Thu Nov 12 07:55:05 -0800 2009	- Use term tid (#630594, jfreyre) - Do not look... [Alex Barth]

README.txt

$Id$
EXTRACTOR
=========
Simple term extraction API.
Usage
=====
With Feeds module:
- Install module
- Edit a feeds configuration, pick either "Common syndication parser with term
  extraction" or "SimplePie parser with term extraction".
- Go to "Mapping" settings of the processor and pick "Extracted term names" or
  "Extracted term tids" from the source drop down and map it to any target that
  handles arrays.
As API:
$terms = extractor_extract($text);

Original Source | Taken Source

developmentseed / extractor

Pledgie Donations