INDEX
Explanations
numbers associated with rarity or frequency
intensifying adverbs that emphasize a degree of quality or state
New Auto-Interp
Negative Logits
osal
-0.73
phyl
-0.72
arians
-0.68
åĤ
-0.68
æ©
-0.67
ierre
-0.67
gent
-0.66
coh
-0.64
çĦ
-0.63
à
-0.62
POSITIVE LOGITS
Again
0.89
Helpful
0.88
Enough
0.85
Rating
0.84
Likely
0.84
Hits
0.83
Leader
0.83
Funny
0.82
Advice
0.82
Fine
0.81
Activations Density 0.029%