Package: polite 0.1.4

polite: Be Nice on the Web
Be responsible when scraping data from websites by following polite principles: introduce yourself, ask for permission, take slowly and never ask twice.
Authors:
polite_0.1.4.tar.gz
polite_0.1.4.zip(r-4.7)polite_0.1.4.zip(r-4.6)polite_0.1.4.zip(r-4.5)
polite_0.1.4.tgz(r-4.6-any)polite_0.1.4.tgz(r-4.5-any)
polite_0.1.4.tar.gz(r-4.7-any)polite_0.1.4.tar.gz(r-4.6-any)
polite_0.1.4.tgz(r-4.6-emscripten)
manual.pdf |manual.html✨
card.svg |card.png
polite/json (API)
NEWS
| # Install 'polite' in R: |
| install.packages('polite', repos = c('https://dmi3kno.r-universe.dev', 'https://cloud.r-project.org')) |
Bug tracker:https://github.com/dmi3kno/polite/issues
Pkgdown/docs site:https://dmi3kno.github.io
crawlermemoiserate-limiterrobotstxtrvestscraperwebscraping
Last updated from:11aa11a674. Checks:9 OK. Indexed: yes.
| Target | Result | Time | Files | Syslog |
|---|---|---|---|---|
| linux-devel-x86_64 | OK | 131 | ||
| source / vignettes | OK | 182 | ||
| linux-release-x86_64 | OK | 125 | ||
| macos-release-arm64 | OK | 95 | ||
| macos-oldrel-arm64 | OK | 91 | ||
| windows-devel | OK | 80 | ||
| windows-release | OK | 88 | ||
| windows-oldrel | OK | 73 | ||
| wasm-release | OK | 110 |
Exports:%>%bowguess_basenamehtml_attrs_dfris.politenodpolitelyripscrapeset_rip_delayset_scrape_delayuse_manners
Dependencies:askpassassertthatcachemclicliprcodetoolscrayoncredentialscurldescdigestfastmapfsfuturefuture.applygertghgitcredsglobalsgluehttrhttr2inijsonlitelifecyclelistenvmagrittrmemoisemimeopensslparallellypillarpkgconfigpurrrR6rappdirsratelimitrRcpprlangrobotstxtrprojrootrstudioapirvestselectrspiderbarstringistringrsystibbleusethisutf8vctrswhiskerwithrxml2yamlzip
Readme and manuals
Help Manual
| Help page | Topics |
|---|---|
| Introduce yourself to the host | bow is.polite |
| Guess download file name from the URL | guess_basename |
| Convert collection of html nodes into data frame | html_attrs_dfr |
| Agree modification of session path with the host | nod |
| Give your web-scraping function good manners polite | politely |
| Print host introduction object | print.polite |
| Polite file download | rip |
| Scrape the content of authorized page/API | scrape |
| Reset scraping/ripping rate limit | set_rip_delay set_scrape_delay |
| Use manners in your own package or script | use_manners |
