INDEX
Explanations
names or words containing the sequence "hn" in them
names of people or characters, particularly surnames
New Auto-Interp
Negative Logits
âĹ¼
-0.82
accompan
-0.73
Yemeni
-0.66
Premium
-0.65
bearer
-0.65
Pokemon
-0.65
Avalanche
-0.65
Primal
-0.64
Maximum
-0.64
drawn
-0.63
POSITIVE LOGITS
hn
1.43
agar
1.10
eman
1.05
swer
0.92
hm
0.90
quist
0.90
avy
0.88
asy
0.88
hak
0.88
hr
0.87
Activations Density 0.004%