Getting Structured Data from the Internet Running Web Crawlers/Scrapers on a Big Data Production Scale /

Podrobná bibliografie
Hlavní autor: Patel, Jay M. (Autor)
Korporativní autor: SpringerLink (Online service)
Shrnutí:XIX, 397 p. 88 illus.
text
Jazyk:angličtina
Vydáno: Berkeley, CA : Apress : Imprint: Apress, 2020.
Vydání:1st ed. 2020.
Témata:
On-line přístup:https://doi.org/10.1007/978-1-4842-6576-5
Médium: Elektronický zdroj Kniha
Obsah:
  • Chapter 1: Introduction to Web Scraping
  • Chapter 2: Web Scraping in Python Using Beautiful Soup Library
  • Chapter 3: Introduction to Cloud Computing and Amazon Web Services (AWS)
  • Chapter 4: Natural Language Processing (NLP) and Text Analytics
  • Chapter 5: Relational Databases and SQL Language
  • Chapter 6: Introduction to Common Crawl Datasets
  • Chapter 7: Web Crawl Processing on Big Data Scale
  • Chapter 8: Advanced Web Crawlers
  • .