DATA MINING

We can help you with:

  • Data Mining (download, cataloging, cleanup, quality control) and other data organization or automated data pre-processing sequence on these open data repositories

  • Creating advanced and functional Web-Based Geoscience Data Repositories

We do the dirty job, like:

  • Crawl & Scrape

  • Automated extraction of geoscience data from the web, open-source datasets, including but not limited to the following formats:

    • LAS, DLIS

    • SEGYs

    • Documents, including DOCX, XLSX, PPTX, TXT

    • Images, including BMP, JPEG

    • Metadata

    • Papers

    • ZIP files, etc

  • Batch clean, sort of online and offline datasets

  • Database Manipulation

    • CRUD (Create, Read, Update & Delete)

    • DAVE (Delete, Add, View & Edit)

 

(Left) Word cloud of the most occurring relevant terms provides an overview of the document at a glance. (Right) A histogram of the extracted terms identified to be related to Lithology.