Score Function: Gradient of the Log-Likelihood Function

August 31, 2024 4 min read Mathematics Statistics Economics Score Function Log-Likelihood Statistical Estimation Maximum Likelihood Gradient

Understanding the score function, its role in statistical estimation, key properties, mathematical formulations, and applications in different fields such as economics, finance, and machine learning.

On this page

The score function is an essential concept in statistics, particularly in the context of statistical estimation and likelihood theory. It is defined as the gradient, or the vector of partial derivatives, of the log-likelihood function with respect to the parameters of the distribution.

Historical Context§

The concept of the score function, along with the likelihood principle, was developed and popularized by Sir Ronald A. Fisher in the early 20th century. Fisher’s work laid the foundation for modern statistical inference and estimation techniques, including the method of maximum likelihood.

Mathematical Formulation§

The score function $U(\theta)$ for a parameter $\theta$ in a probability distribution is given by:

U(\theta) = \frac{\partial \log L(\theta; x)}{\partial \theta}

where:

$L(\theta; x)$ is the likelihood function given the data $x$ .
$\log L(\theta; x)$ is the log-likelihood function.

For a vector of parameters $\theta = (\theta_1, \theta_2, \ldots, \theta_k)$ , the score function is a vector of partial derivatives:

U(\theta) = \left( \frac{\partial \log L(\theta; x)}{\partial \theta_1}, \frac{\partial \log L(\theta; x)}{\partial \theta_2}, \ldots, \frac{\partial \log L(\theta; x)}{\partial \theta_k} \right)

Key Properties§

Unbiasedness: The expected value of the score function is zero:
$E[U(\theta)] = 0$ This property holds under the true parameter value $\theta$ .
Information: The variance of the score function is related to the Fisher Information $I(\theta)$ :
$Var(U(\theta)) = I(\theta)$ Fisher Information measures the amount of information that an observable data carries about an unknown parameter.

Importance and Applicability§

The score function plays a crucial role in the method of maximum likelihood estimation (MLE). The maximum likelihood estimator $\hat{\theta}$ is found by solving the score equation:

U(\hat{\theta}) = 0

Applications in Various Fields§

Economics and Finance: Used to estimate model parameters such as in asset pricing models, risk assessment, and economic forecasting.
Machine Learning: Essential in training models, particularly in optimization algorithms like gradient descent.
Medical Research: Used in survival analysis and logistic regression models for clinical studies.

Example§

Consider a simple example of estimating the mean $\mu$ of a normal distribution with known variance $\sigma^2$ . The likelihood function $L(\mu; x)$ given the data $x = (x_1, x_2, \ldots, x_n)$ is:

L(\mu; x) = \prod_{i=1}^{n} \frac{1}{\sqrt{2\pi\sigma^2}} \exp \left( -\frac{(x_i - \mu)^2}{2\sigma^2} \right)

The log-likelihood function is:

\log L(\mu; x) = -\frac{n}{2} \log (2\pi\sigma^2) - \frac{1}{2\sigma^2} \sum_{i=1}^{n} (x_i - \mu)^2

The score function with respect to $\mu$ is:

U(\mu) = \frac{\partial \log L(\mu; x)}{\partial \mu} = \frac{1}{\sigma^2} \sum_{i=1}^{n} (x_i - \mu)

Setting $U(\hat{\mu}) = 0$ yields the MLE:

\hat{\mu} = \frac{1}{n} \sum_{i=1}^{n} x_i

FAQs§

Why is the score function important in MLE?

The score function is essential in MLE as solving

U(\hat{\theta}) = 0

gives the maximum likelihood estimates of the parameters.

What is the relationship between the score function and Fisher Information?

The variance of the score function is equal to the Fisher Information, which quantifies the information about the parameter contained in the data.

Likelihood Function: A function of parameters given specific observed data, representing the probability of observing that data.
Log-Likelihood Function: The natural logarithm of the likelihood function, often easier to maximize.
Fisher Information: A measure of the amount of information that an observable random variable carries about an unknown parameter.

Famous Quotes§

“The method of maximum likelihood is a method of estimation in which the estimate of the parameter of a model is that value which, under the assumed model, maximizes the likelihood function.” — Sir Ronald A. Fisher

Summary§

The score function is a fundamental concept in statistical inference and MLE, representing the gradient of the log-likelihood function with respect to model parameters. It provides crucial information for parameter estimation, with wide applications in economics, finance, and various scientific fields.

By understanding the score function, its properties, and its applications, we gain deeper insights into statistical estimation methods and their broad applicability across different domains.

References§

Fisher, R.A. “The Logic of Scientific Inference.” (1935).
Efron, B., & Hinkley, D.V. “Assessing the accuracy of the maximum likelihood estimator: Observed versus expected Fisher Information.” Biometrika (1978).