INDEX
Explanations
references to the FX television network
New Auto-Interp
Negative Logits
Adin
-0.95
captcha
-0.70
Fas
-0.69
hood
-0.68
arians
-0.67
oys
-0.67
inated
-0.65
bell
-0.64
izational
-0.63
quartered
-0.63
POSITIVE LOGITS
FX
0.89
SW
0.89
VI
0.84
VII
0.84
III
0.83
ML
0.83
II
0.82
Fighters
0.80
FP
0.79
assi
0.78
Activations Density 0.005%