INDEX
Explanations
expressions of agreement or affirmation
New Auto-Interp
Negative Logits
EStreamFrame
-0.72
cones
-0.65
jelly
-0.64
retaliation
-0.63
cone
-0.62
shock
-0.62
fly
-0.62
restitution
-0.61
drinkers
-0.61
consolidation
-0.60
POSITIVE LOGITS
ities
0.88
aeda
0.75
sson
0.75
ilon
0.75
encer
0.75
.;
0.75
uary
0.74
ova
0.74
onica
0.73
hua
0.73
Activations Density 0.064%