INDEX
Explanations
terms related to historical events and organizations, especially those with specific abbreviations or acronyms
references to geographic or organizational designations
New Auto-Interp
Negative Logits
Franch
-0.73
idates
-0.72
uate
-0.71
wagen
-0.71
Nig
-0.70
fitting
-0.70
utive
-0.69
uated
-0.67
holders
-0.66
ulator
-0.66
POSITIVE LOGITS
WW
1.23
II
1.13
III
1.13
JD
1.09
WF
1.05
LD
1.04
MJ
1.03
BF
0.99
OA
0.99
TW
0.97
Activations Density 0.020%