INDEX
Explanations
lists of people's names
New Auto-Interp
Negative Logits
=\"
-0.71
institution
-0.68
wings
-0.66
endeavor
-0.66
existence
-0.63
economy
-0.62
parts
-0.61
opia
-0.61
ballpark
-0.61
vation
-0.61
POSITIVE LOGITS
Jr
1.09
Raphael
1.04
Geoff
1.02
Isabel
0.99
Gerald
0.99
Shant
0.98
Maurice
0.98
Katherine
0.97
Richie
0.97
Samuel
0.97
Activations Density 0.109%