is a scientific discovery and practice that involves the collection, management, processing, analysis, visualization, and interpretation of vast amounts of heterogeneous data associated with a diverse array of scientific, translational, and inter-disciplinary applications
data science
has been considered an interdisciplinary discipline
data scientist
is a language and environment for statistical computing and graphics
r
is an object-oriented, interpreted, and interactive programming language
python
is a programming language
sas
is a function whose value is a real number determined by each element in the sample space.
ramdom variable
It is a random variable that can take only countable values, or if its set of possible values is in one-to-one correspondence with a subset of natural numbers.
discrete
It is a random variable that can assume an infinite number of values in an interval between two (2) specific values.
continuos
is a table, a graph, or a formula listing all possible values that a discrete random variable can take on, along with the associated probabilities.
discrete probability distribution
In an experiment of trials, each trial has two (2) possible outcomes: success or failure
binomial
counts the number of rare events or successes that occur in a specified time interval or region
poison
the most important of all continuous probability distributions is the normal distribution. Its graph, called the normal curve, is a bell-shaped curve