INDEX
Explanations
the definite article "the"
New Auto-Interp
Head Attr Weights
0:0.08
1:0.07
2:0.08
3:0.08
4:0.08
5:0.08
6:0.09
7:0.08
8:0.08
9:0.08
10:0.08
11:0.07
Negative Logits
IGH
-1.68
Nightmares
-1.64
Released
-1.58
hack
-1.53
Prol
-1.51
iHUD
-1.51
Posted
-1.49
jah
-1.46
Leah
-1.42
Bett
-1.39
POSITIVE LOGITS
amorph
1.62
ween
1.58
pots
1.55
continents
1.54
cius
1.51
respectively
1.48
peror
1.48
arching
1.47
rotating
1.46
ozy
1.45
Activations Density 0.000%