Next: Statistical Plots, Previous: Descriptive Statistics, Up: Statistics
Octave also supports various helpful statistical functions.
Return the Mahalanobis' D-square distance between the multivariate samples x and y, which must have the same number of components (columns), but may have a different number of observations (rows).
If x is a vector, subtract its mean. If x is a matrix, do the above for each column. If the optional argument dim is given, perform the above operation along this dimension
If x is a vector, subtract its mean and divide by its standard deviation.
If x is a matrix, do the above along the first non-singleton dimension. If the optional argument dim is given then operate along this dimension.
Compute the binomial coefficient or all combinations of n. If n is a scalar then, calculate the binomial coefficient of n and k, defined as
/ \ | n | n (n-1) (n-2) ... (n-k+1) n! | | = ------------------------- = --------- | k | k! k! (n-k)! \ /If n is a vector generate all combinations of the elements of n, taken k at a time, one row per combination. The resulting c has size
[nchoosek (length (n),k),k].See also: bincoeff.
Generate all permutations of v, one row per permutation. The result has size
factorial (n) *n, where n is the length of v.As an example,
perms([1, 2, 3])returns the matrix1 2 3 2 1 3 1 3 2 2 3 1 3 1 2 3 2 1
Return the different values in a column vector, arranged in ascending order.
As an example,
values([1, 2, 3, 1])returns the vector[1, 2, 3].
Create a contingency table t from data vectors. The l vectors are the corresponding levels.
Currently, only 1- and 2-dimensional tables are supported.
Compute Spearman's rank correlation coefficient rho for each of the variables specified by the input arguments.
For matrices, each row is an observation and each column a variable; vectors are always observations and may be row or column vectors.
spearman (x)is equivalent tospearman (x,x).For two data vectors x and y, Spearman's rho is the correlation of the ranks of x and y.
If x and y are drawn from independent distributions, rho has zero mean and variance
1 / (n - 1), and is asymptotically normally distributed.
Count the upward runs along the first non-singleton dimension of x of length 1, 2, ..., n-1 and greater than or equal to n. If the optional argument dim is given operate along this dimension
If x is a vector, return the (column) vector of ranks of x adjusted for ties.
If x is a matrix, do the above for along the first non-singleton dimension. If the optional argument dim is given, operate along this dimension.
If x is a vector, return the range, i.e., the difference between the maximum and the minimum, of the input data.
If x is a matrix, do the above for each column of x.
If the optional argument dim is supplied, work along dimension dim.
For each component of p, return the probit (the quantile of the standard normal distribution) of p.
For each component of p, return the logit of p defined as
logit(p) = log (p / (1-p))
Return the complementary log-log function of x, defined as
cloglog(x) = - log (- log (x))
Compute Kendall's tau for each of the variables specified by the input arguments.
For matrices, each row is an observation and each column a variable; vectors are always observations and may be row or column vectors.
kendall (x)is equivalent tokendall (x,x).For two data vectors x, y of common length n, Kendall's tau is the correlation of the signs of all rank differences of x and y; i.e., if both x and y have distinct entries, then
1 tau = ------- SUM sign (q(i) - q(j)) * sign (r(i) - r(j)) n (n-1) i,jin which the q(i) and r(i) are the ranks of x and y, respectively.
If x and y are drawn from independent distributions, Kendall's tau is asymptotically normal with mean 0 and variance
(2 * (2n+5)) / (9 *n* (n-1)).
If x is a vector, return the interquartile range, i.e., the difference between the upper and lower quartile, of the input data.
If x is a matrix, do the above for first non singleton dimension of x. If the option dim argument is given, then operate along this dimension.
Create categorical data out of numerical or continuous data by cutting into intervals.
If breaks is a scalar, the data is cut into that many equal-width intervals. If breaks is a vector of break points, the category has
length (breaks) - 1groups.The returned value is a vector of the same size as x telling which group each point in x belongs to. Groups are labelled from 1 to the number of groups; points outside the range of breaks are labelled by
NaN.