INDEX
Explanations
proper nouns
references to the word "gan" in various contexts
New Auto-Interp
Negative Logits
differential
-0.71
Adin
-0.67
PER
-0.64
mates
-0.63
Bravo
-0.62
İĭ
-0.62
ractive
-0.61
ension
-0.60
acter
-0.59
ention
-0.58
POSITIVE LOGITS
igans
1.10
culosis
0.99
igan
0.94
ciating
0.91
gha
0.88
isbury
0.87
cest
0.87
gan
0.87
ãĥĥãĤ¯
0.86
aghan
0.84
Activations Density 0.012%