INDEX
Explanations
expressions of empathy or sharing announcements
New Auto-Interp
Negative Logits
mishand
-0.61
sbm
-0.61
mist
-0.61
clashed
-0.60
Vog
-0.59
outnumbered
-0.59
Leh
-0.59
Created
-0.59
hull
-0.58
alties
-0.57
POSITIVE LOGITS
myself
1.09
congratulations
0.86
disclaimer
0.84
thank
0.81
ourselves
0.79
THANK
0.79
ASAP
0.79
:]
0.78
tonight
0.78
my
0.77
Activations Density 0.245%