INDEX
Explanations
references to the name "Mansour."
proper nouns, specifically names related to individuals and places
New Auto-Interp
Negative Logits
istically
-0.67
dism
-0.64
pta
-0.62
atton
-0.61
OPLE
-0.59
ISM
-0.58
Äĩ
-0.58
èª
-0.57
tis
-0.56
Gears
-0.56
POSITIVE LOGITS
laughter
1.30
pread
1.20
creen
1.07
hap
1.07
cape
0.95
plain
0.91
hiro
0.91
sein
0.90
ensibly
0.90
ultan
0.88
Activations Density 0.093%