INDEX
Explanations
references to historical figures with the surname "Roosevelt"
references to President Roosevelt and his administration
New Auto-Interp
Negative Logits
ateurs
-0.78
raints
-0.77
LESS
-0.69
ATIVE
-0.68
ning
-0.68
uating
-0.68
las
-0.67
alities
-0.67
RED
-0.66
rav
-0.65
POSITIVE LOGITS
Roosevelt
1.17
velt
1.00
enthal
0.90
iets
0.82
hower
0.79
Doctrine
0.77
appoint
0.77
dinand
0.76
enstein
0.76
ufact
0.75
Activations Density 0.019%