INDEX
Explanations
occurrences of the letter 'G' in various contexts
New Auto-Interp
Negative Logits
-0.17
ktop
-0.16
amma
-0.15
erot
-0.15
drop
-0.15
aler
-0.15
aukee
-0.14
etur
-0.14
udder
-0.14
Schwartz
-0.14
POSITIVE LOGITS
lyn
0.23
Ps
0.21
last
0.19
illing
0.19
ouce
0.18
wers
0.18
rah
0.17
went
0.17
urd
0.17
affer
0.17
Activations Density 0.014%