INDEX
Explanations
proper nouns or names, specifically those related to "Jose"
references to the name "Jose."
New Auto-Interp
Negative Logits
ãĤ¤ãĥĪ
-0.79
ICLE
-0.75
Unicorn
-0.73
Els
-0.72
wealth
-0.68
女
-0.67
åĤ
-0.67
ymph
-0.66
ãĥ¤
-0.64
ickle
-0.64
POSITIVE LOGITS
lins
0.87
upe
0.84
Jose
0.83
bers
0.83
ber
0.81
keley
0.76
Francisco
0.76
ppa
0.76
Earthqu
0.75
ppers
0.74
Activations Density 0.007%