We can observe that the frequencies decrease fairly rapidly even in the case of the log frequencies. Thus some of the syllables (words) are used very often while a majority of them are used sparsely.
The size parameters are,
Number of syllables : 16604
Number of Words: 208970
Let's plot the whole plot till 10000 ranks [Here (LOG_freq)] and [Here (Normal_freq)], then if we observe closely we can notice that the area under the curve for the first part (Point till the 0.1* max_freq) and the rest of the part is roughly equal.
Sum_P1 = 3867153
Sum_P2 = 2980027
Sum_P1/Sum_P2 = 1.29
(Sum refers to the sum of the frequencies, and consequently the area under the curve)
This effect is referred to as long tail effect and is common in plots related to purchases of products. So we can also conclude that majority of interactions are formed by less used words.
The following are the plots for normal frequencies (instead of log)