As already hinted by Dean Miller, the two definitions are not only equivalent - they're exactly the same definition with a different phrasing. This becomes evident once you unpack these definitions.
$\newcommand{\pl}{\partial}$
What does it mean for $u \in L^p$ to have a weak derivative in $L^p$?
Well, the definition is: there are functions $\pl_i u \in L^p$ ($i = 1,2,\ldots,n$) satisfying
$$
\int \pl_i u \cdot \varphi = - \int u \cdot \pl_i \varphi
\quad \text{for } \varphi \in C_c^\infty.
$$
(note that $\pl_i \varphi$ is the classical derivative, while $\pl_i u$ slightly abuses the notation - one has to understand that it's just the weak derivative, and that it's unique)
What does it mean for $u \in L^p$ to have a distributional derivative in $L^p$?
The only difference here is that you don't need more assumptions on $u$ to make sense of the distributional derivative. By definition, the $i$-th partial distributional derivative is the linear operator $D_i u$ given by
$$
D_i u [\varphi] := - \int u \cdot \pl_i \varphi
\quad \text{for } \varphi \in C_c^\infty.
$$
Now, $D_i$ a linear operator, so what do we mean when we say "it's in $L^p$"? Actually, it's another abuse of language, and we actually mean that "$D_i u$ can be represented by an $L^p$ function", or more precisely, there is a function $\pl_i u \in L^p$ satisfying
$$
D_i u [\varphi] = \int \pl_i u \cdot \varphi
\quad \text{for } \varphi \in C_c^\infty.
$$
(it's not uncommon to see even more notational abuse here, with $\pl_i u$ denoting both the distribution and the related function)
As you can see, it's one and the same thing. The notions of distributions and distributional derivatives give us a general framework with very little assumptions, which is sometimes handy. But if you're given some particular PDE, you only consider derivatives up to some order, and typically you also need a complete space in order for standard existence methods to work - that's where Sobolev spaces come in. If your PDE is not linear (or with rough coefficients), then you have to assume these derivatives are represented by functions, anyway. Thus, people most often talk about weak derivatives, even if it's the same thing in the end.