INDEX
Explanations
phrases related to politics and political figures
New Auto-Interp
Negative Logits
clitor
-0.79
pigeon
-0.76
reflex
-0.70
peripheral
-0.69
lapse
-0.68
subp
-0.67
territorial
-0.67
mounts
-0.66
avenues
-0.66
trail
-0.65
POSITIVE LOGITS
ï¸ı
1.47
ternity
1.04
ï¸
0.91
there
0.91
âĶĢâĶĢ
0.91
uthor
0.89
amily
0.88
then
0.84
ł
0.82
conom
0.81
Activations Density 0.369%