INDEX
Explanations
references to groups, lists, or categories of people or items
New Auto-Interp
Negative Logits
oneg
-0.55
Rhestr
-0.54
ipto
-0.53
Chwiliwch
-0.52
udaler
-0.48
queous
-0.47
ilene
-0.47
zheimer
-0.46
loyees
-0.46
ondissement
-0.45
POSITIVE LOGITS
other
1.10
other
1.00
autres
0.95
others
0.91
autres
0.90
others
0.89
Others
0.85
Other
0.83
Other
0.82
otras
0.80
Activations Density 0.293%