I am trying to understand the general logic on how to write the Likelihood Function of a Stochastic Process.
In some situations, I think I can understand how this is done. For example, suppose $X_t$ is a standard Wiener Process (https://en.wikipedia.org/wiki/Wiener_proces) with drift $\mu$ and diffusion $\sigma$:
$$ dX_t = \mu dt + \sigma dW_t $$
By definition, we know that $dW_t \sim N(0, dt)$. We also know that the difference between two terms within the Wiener Process are iid themselves. Suppose I had some real data and I believed it originated from a Wiener Process. I could first define a new random variable $Z$ based on differences of the Wiener Process at available times:
$$ Z = X_{t+1} - X_t $$ $$ X_{t+1} = X_t + \mu \Delta t + \sigma \sqrt{\Delta t} W $$
$$ Z = (X_t + \mu \Delta t + \sigma \sqrt{\Delta t} W) - X_t $$ $$ Z = \mu \Delta t + \sigma \sqrt{\Delta t} W $$
From here, we can see that $ Z \sim N(\mu \cdot \Delta t, \sigma^2 \Delta t)$. I should be able to write a Likelihood Function based on $Z$:
$$ L(\mu, \sigma^2 | Z_1, Z_2, ..., Z_n) = \prod_{i=1}^{n} \frac{1}{\sqrt{2\pi\sigma^2\Delta t}} \exp\left(-\frac{(Z_i - \mu\Delta t)^2}{2\sigma^2\Delta t}\right) $$
I want to now repeat this logic and derive the likelihood function for a Continuous Time Birth-Death Stochastic Process (https://en.wikipedia.org/wiki/Birth%E2%80%93death_process)
Here, I attempted to mathematically derive a similar likelihood function for the discrete time case (for different scenarios): Estimating Population Growth with Limited Information. I also posted here Can an Infinite Matrix be Partitioned? and Relationship Between Poisson Process and Birth and Death Process, I tried to logic this out myself.
Let $X_t$ be the state of the system at time $t$, representing the number of individuals in the population. The Birth Rate: $\lambda_n$ (rate at which the population increases from $n$ to $n+1$) and Death rate:$ \mu_n$ (rate at which the population decreases from $n$ to $n-1$.
Suppose we observe the system at time points $t_0, t_1, \ldots, t_n$ with corresponding states $X_{t_0} = x_0, X_{t_1} = x_1, \ldots, X_{t_n} = x_n$
From here, I think we can define an infinite sized Generator Matrix:
$$Q_{ij} = \begin{cases} \lambda_i & \text{if } j = i + 1 \\ \mu_i & \text{if } j = i - 1 \\ -(\lambda_i + \mu_i) & \text{if } j = i \\ 0 & \text{otherwise} \end{cases}$$
In Continuous Time Stochastic Processes, we usually use the following equation to relate the Generator Matrix to the Transition Probability Matrix:
$$P(t) = e^{Qt}$$ $$P_{ij}(\Delta t) \approx \begin{cases} \lambda_i \Delta t & \text{if } j = i + 1 \\ \mu_i \Delta t & \text{if } j = i - 1 \\ 1 - (\lambda_i + \mu_i) \Delta t & \text{if } j = i \\ 0 & \text{otherwise} \end{cases}$$
Just as we transformed the above Wiener Process into a new iid random variable, I think we can also assume independence of the Birth Death Process over small time intervals and write the likelihood as a joint product of probabilities:
$$L(\{\lambda_i\}, \{\mu_i\} | x_0, x_1, \ldots, x_n) = \prod_{k=1}^{n} P_{x_{k-1}, x_k}(t_k - t_{k-1})$$
$$L(\{\lambda_i\}, \{\mu_i\} | x_0, x_1, \ldots, x_n) \approx \prod_{k=1}^{n} \begin{cases} \lambda_{x_{k-1}} \Delta t & \text{if } x_k = x_{k-1} + 1 \\ \mu_{x_{k-1}} \Delta t & \text{if } x_k = x_{k-1} - 1 \\ 1 - (\lambda_{x_{k-1}} + \mu_{x_{k-1}}) \Delta t & \text{if } x_k = x_{k-1} \end{cases}$$
$$L(\{\lambda_i\}, \{\mu_i\} | x_0, x_1, \ldots, x_n) = \prod_{k=1}^{n} P_{x_{k-1}, x_k}(t_k - t_{k-1})$$
Where $P_{x_{k-1}, x_k}(t_k - t_{k-1})$ are the exact transition probabilities from the generator matrix exponentiation $e^{Q(t_k - t_{k-1})}$
Can someone please show me how this might be extended to the Continuous Time case?