INDEX
Explanations
names starting with "Jo"
the name "Jo" across various contexts
New Auto-Interp
Negative Logits
difference
-0.65
rified
-0.62
Greeks
-0.62
vectors
-0.62
binary
-0.61
Iranians
-0.61
flies
-0.61
Europeans
-0.60
Horizons
-0.60
bombard
-0.59
POSITIVE LOGITS
jo
1.53
aquin
1.16
zzo
1.05
JO
1.00
Jo
0.98
vernment
0.98
atana
0.96
ffee
0.95
ichi
0.94
itsu
0.89
Activations Density 0.007%