Scrape

Structured Data extraction from common web resources, using information-retrieval techniques. See the docs

Installation

The package can be installed by adding scrape to your list of dependencies in mix.exs:

def deps do
  [
    {:scrape, "~> 3.0.0"}
  ]
end

Known Issues

This package uses an outdated version of httpoison because of keepcosmos/readability. You can override this in your app with override: true and everything should work.
The current version 3.X is a complete rewrite from scratch, so some new issues might occur and the API has changed. Please provide some URL to a HTML/Feed document when submitting issues, so I can look into it for bugfixing.

Usage

Scrape.domain!(url) -> get structured data of a domain-type url (like https://bbc.com)
Scrape.feed!(url) -> get structured data of a RSS/Atom feed
Scrape.article!(url) -> get structured data of an article-type url

License

LGPLv3. You can use this package any way you want (including commercially), but I want bugfixes and improvements to flow back into this package for everyone's benefit.

Name		Name	Last commit message	Last commit date
Latest commit History 175 Commits
cache		cache
config		config
lib		lib
test		test
.formatter.exs		.formatter.exs
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
README.md		README.md
mix.exs		mix.exs
mix.lock		mix.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Scrape

Installation

Known Issues

Usage

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 13

Uh oh!

Languages

License

Anonyfox/elixir-scrape

Folders and files

Latest commit

History

Repository files navigation

Scrape

Installation

Known Issues

Usage

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 13

Uh oh!

Languages

Packages