INDEX
Explanations
names that start with "Jo"
proper nouns, particularly names of individuals and teams
New Auto-Interp
Negative Logits
yip
-0.69
enegger
-0.66
uously
-0.61
Circus
-0.59
uits
-0.56
uous
-0.56
pasta
-0.56
present
-0.55
fires
-0.55
spill
-0.55
POSITIVE LOGITS
aku
0.83
lyn
0.66
Jr
0.66
angan
0.66
isha
0.66
emp
0.65
cott
0.65
steen
0.64
sburg
0.64
ãģĨ
0.64
Activations Density 0.114%