INDEX
Explanations
people's names, particularly those related to legal or political contexts
New Auto-Interp
Negative Logits
icles
-0.46
rx
-0.43
tails
-0.42
á½
-0.40
uate
-0.40
osc
-0.40
match
-0.39
UE
-0.39
sync
-0.39
icle
-0.39
POSITIVE LOGITS
Wiggins
0.56
Manning
0.56
Bradley
0.56
Cooper
0.47
Sisters
0.42
Byrne
0.41
pigeon
0.41
Fir
0.40
McD
0.40
Kirst
0.39
Activations Density 0.653%