INDEX
Explanations
Roosevelt, FDR, Theodore, Franklin
New Auto-Interp
Negative Logits
甕
0.40
nigeria
0.38
BAY
0.38
Ambris
0.38
烏
0.37
셍
0.37
cơn
0.37
ULA
0.36
flutter
0.36
niger
0.36
POSITIVE LOGITS
Roosevelt
2.13
FDR
1.77
Franklin
1.44
osevelt
1.44
Franklin
1.43
Teddy
1.39
Churchill
1.34
Teddy
1.32
Theodore
1.23
Eisenhower
1.20
Activations Density 0.006%