INDEX
Explanations
the word "again"
instances of the word "agin" and related variations
New Auto-Interp
Negative Logits
ding
-0.78
iffe
-0.75
theless
-0.69
fax
-0.66
jar
-0.66
header
-0.66
grading
-0.64
fer
-0.64
izen
-0.63
ded
-0.63
POSITIVE LOGITS
amus
1.12
agin
0.89
uments
0.89
================================================================
0.85
azine
0.80
entials
0.78
ement
0.76
aceae
0.76
insula
0.76
amaz
0.76
Activations Density 0.031%