I am a non-student working through the first edition of Yates and Goodman's text, Probability and Stochastic Processes. On page $115$, question $3.6.9$ goes like this:
Each millisecond at a telephone switch, a call independently arrives with probability $p$. Each call is either a data call $(d)$ with probability $q$ or a voice call $(v)$. Each data is a fax call with probability $r$. Let $N$ equal the number of milliseconds required to observe the first $100$ fax calls. Let $T$ equal the number of milliseconds you observe the switch waiting for the first fax call. Find the marginal PMF $P_T(t)$ and the conditional PMF $P_{N|T}(n,t)$. Lastly, find the conditional PMF $P_{T|N}(t,n).$
Now, I seem to have two events with a random chance of happening: getting a call, and then having that call specifically yield a fax call.
From the perspective of getting a fax call (given getting a call), I don't think the PMF is difficult to figure out: $$\text{PMF} =\begin{cases} r(1-r)^{t-1},\quad t=1,2,... \\ 0,\quad \text{otherwise} \end{cases}$$ Of course, now we have to consider that we might not get a phone call altogether, and here is where my (personal) first question enters:
(a) If we have two (or more!) random variables, nested within each other (in this context, a getting a fax call assuming that we've gotten a call), how do we factor in the second variable into the PMF? My initial guess was this:
$$\text{PMF} =\begin{cases} q \cdot r(1-r)^{t-1},\quad t=1,2,... \\ 0,\quad \text{otherwise} \end{cases},$$ essentially multiplying the first PMF I showed by the probability of getting a call at all. My reasoning is that the first PMF (with only factor $r$) that I showed is inherently the probability given that a call was received; multiply that by the probability that a call was received, and you should get $P_T(t)$. However, should I be including a $(1-q)$ term in there? If so, to what power do I raise it to? It can't be $t-1$.
(b) How would I go about computing the joint PMF $P_{N,T}(n,t)$? This is necessary, as I understand it, for calculating the conditional PMFs. My guess is that $T$ is geometrically distributed, and $N$ is a Negative Binomial/"Pascal" distributed random variable.