INDEX
Explanations
proper names involving the name "Sam" or variations thereof
occurrences of the substring "sam" in various contexts
New Auto-Interp
Negative Logits
ruary
-0.79
enegger
-0.78
Reviewer
-0.69
ãģ®ç
-0.67
Engels
-0.65
thinkable
-0.64
lished
-0.63
Posted
-0.63
fundament
-0.62
arrang
-0.60
POSITIVE LOGITS
udi
0.85
inki
0.71
opol
0.70
Pok
0.70
asin
0.69
onite
0.67
Kov
0.67
itans
0.66
uku
0.66
owitz
0.65
Activations Density 0.123%