INDEX
Explanations
references to a specific name or person, particularly "Ivanka."
New Auto-Interp
Negative Logits
Clicker
-0.83
stere
-0.78
puzz
-0.76
Camer
-0.70
rapt
-0.70
Clim
-0.70
hig
-0.69
Kling
-0.68
Wheel
-0.68
Hallow
-0.67
POSITIVE LOGITS
anka
2.35
ANA
2.15
ña
2.12
anya
2.06
aja
2.05
enna
1.81
itta
1.70
acha
1.68
apa
1.68
vana
1.64
Activations Density 0.027%