INDEX
Explanations
specific names and references related to individuals, political parties or positions, and specific events
New Auto-Interp
Negative Logits
»Ĵ
-0.81
å§«
-0.77
ĥ
-0.73
ĺ
-0.72
ousands
-0.70
acas
-0.69
Ħ¢
-0.69
alyses
-0.68
ĭ
-0.67
ãĥł
-0.67
POSITIVE LOGITS
fault
0.93
anyway
0.80
equivalent
0.79
rather
0.77
minus
0.77
versus
0.73
consolation
0.72
disguised
0.72
reborn
0.71
Incarn
0.70
Activations Density 2.716%