INDEX
Explanations
references to media outlets and institutions
New Auto-Interp
Negative Logits
ãĤ°
-0.62
ãĥ£
-0.55
deserted
-0.54
looph
-0.52
vex
-0.51
throb
-0.51
deduct
-0.51
hump
-0.51
wakes
-0.51
AND
-0.50
POSITIVE LOGITS
respectively
1.67
alike
1.09
+.
0.88
attRot
0.83
LLP
0.81
.).
0.74
*.
0.74
.[
0.73
)).
0.71
versa
0.70
Activations Density 0.467%