This question pertains to theData Acquisition and Preparationdomain, which in DA0-002 includes understanding data acquisition concepts and the types of data gathered from varioussources, such as web scraping. Web scraping involves extracting data from websites, and the data gathered is often semi-structured, meaning it has some organizational structure but isn’t fully relational like a database table.
JSON (Option A): JSON (JavaScript Object Notation) is a semi-structured data format commonly used in web applications. Web scraping often retrieves data in JSON format via APIs or embedded scripts, as it’s lightweight and structured with key-value pairs, making it ideal for semi-structured data.
CSV (Option B): CSV (Comma-Separated Values) is a structured format typically used for tabular data. It’s not commonly the direct output of web scraping, though scraped data might be converted to CSV later.
CSS (Option C): CSS (Cascading Style Sheets) is used for styling web pages and isn’t a data format, making it irrelevant for describing scraped data.
HTML (Option D): HTML (HyperText Markup Language) is the structure of web pages and is often the raw format scraped during web scraping. While HTML is semi-structured due to its tag-based hierarchy, it’s primarily a markup language, not a data format, and the actual data extracted is often parsed into formats like JSON.
The DA0-002 Data Acquisition and Preparation domain aligns with the DA0-001 focus on "data acquisition concepts" (web ID: 14), which includes identifying formats like JSON as semi-structured data commonly acquired through web scraping. JSON is the best fit here due to its prevalence in web data exchange.
[Reference: CompTIA Data+ DA0-002 Draft Exam Objectives, Domain 2.0 Data Acquisition and Preparation, ==============]
Contribute your Thoughts:
Chosen Answer:
This is a voting comment (?). You can switch to a simple comment. It is better to Upvote an existing comment if you don't have anything to add.
Submit