INDEX
Explanations
names of people, locations, and organizations
specific names or terms related to notable individuals or organizations
New Auto-Interp
Negative Logits
ahime
-0.81
orem
-0.77
erity
-0.65
REP
-0.61
vow
-0.58
itous
-0.58
aram
-0.58
distance
-0.58
miscarriage
-0.58
obliged
-0.58
POSITIVE LOGITS
ĩ
0.76
Kong
0.75
--+
0.74
ĵĺ
0.73
èª
0.73
ãĤ¡
0.68
Train
0.67
Walton
0.66
«
0.66
PAC
0.65
Activations Density 0.286%