INDEX
Explanations
words related to statements or quotations
phrases that express opinions or describe conditions about a subject
New Auto-Interp
Negative Logits
luaj
-0.92
edIn
-0.71
Versions
-0.66
aired
-0.65
bies
-0.63
arthed
-0.62
lat
-0.62
%%
-0.61
andise
-0.61
resp
-0.61
POSITIVE LOGITS
definitely
1.08
gonna
0.97
gotta
0.94
bitters
0.94
basically
0.90
certainly
0.89
unbelievable
0.88
frustrating
0.84
honestly
0.84
obviously
0.83
Activations Density 0.281%