INDEX
Explanations
phrases related to individuals named "Jo" followed by a number
New Auto-Interp
Negative Logits
flies
-0.68
interests
-0.67
IFIED
-0.67
Fired
-0.65
BALL
-0.64
raits
-0.64
IMAGES
-0.62
Pwr
-0.62
amplification
-0.62
CLASS
-0.62
POSITIVE LOGITS
aquin
1.40
Jo
1.23
ining
1.08
zeb
1.07
anne
1.07
jo
1.06
zy
1.01
ppa
1.01
areth
0.97
Anne
0.96
Activations Density 0.018%