INDEX
Explanations
proper nouns, specifically "Mas" with different variations
mentions of the name "Mas" or variations related to it
New Auto-Interp
Negative Logits
sburgh
-0.95
ship
-0.76
ski
-0.73
orer
-0.71
ï¸ı
-0.70
OUGH
-0.65
ACTED
-0.64
BOOK
-0.64
Ames
-0.63
ships
-0.63
POSITIVE LOGITS
quer
1.07
ques
1.03
cul
1.00
seys
0.97
dar
0.96
iple
0.95
Mas
0.92
querade
0.92
sey
0.90
idi
0.89
Activations Density 0.006%