INDEX
Explanations
numerical data related to events and statistics
New Auto-Interp
Negative Logits
unaff
-0.66
verages
-0.65
unsur
-0.63
iola
-0.62
ardent
-0.61
Elect
-0.60
chest
-0.60
collaborated
-0.60
endangered
-0.58
spons
-0.58
POSITIVE LOGITS
<+
0.78
Dialogue
0.73
1969
0.71
sol
0.70
UID
0.69
Timeout
0.69
SAY
0.69
audio
0.66
hower
0.66
ewitness
0.66
Activations Density 0.043%