Skip to main content

Datasets, Data Mining, and Process Mining

Dataset is a data collection, mostly in some database format.

  • .csv format
  • where to get datasets
  • data scraping
  • cleaning and repairing datasets
  • data frames and its flavors

Using PHP and MySQL

PHP (Hypertext Preprocessor) is server-side scripting programming language and it is used for interacting with MySQL databases.

  • what we can use it for
  • data science applications
  • updating and writing data

Data Mining Techniques

  • Big Data
  • processing and cleaning
  • methods

Web scraping

Collecting data from web sources is called web scraping.

We can use python for HTML parsing and then clean the data.

  • BeautifulSoup

In Anaconda prompt, install BeautifulSoup:

conda install beautifulsoup4
  • Scrape the webpages!
  • Clean the data

Creating Data Frames

  • on-premises and cloud data platforms

Process Mining

  • insights on how things work
note

Goals.

info

Goals.

caution

Goals.

danger

Goals.

Use tabs in admonitions
This is an apple 🍎
npm run build
docs/hello.md
# Hello

This is my **first Docusaurus document**!
# Hello

This is my **first Docusaurus document**!
CLICK ME

yes, even hidden code blocks!

chaptertest

print("hello world!")

Em Dash (—) En Dash (–)

Ctrl+Alt+0

Soubor ‣ Nový ‣ Obecné. (File ‣ New ‣ General)