INDEX
Explanations
repeated mentions of the word "as" and variations of the verb "to be"
New Auto-Interp
Head Attr Weights
0:0.03
1:0.02
2:0.05
3:0.06
4:0.05
5:0.04
6:0.39
7:0.04
8:0.06
9:0.08
10:0.07
11:0.05
Negative Logits
innocence
-1.39
blackout
-1.34
remission
-1.22
landfall
-1.21
hover
-1.20
Haram
-1.20
breastfeeding
-1.19
dehydration
-1.18
tampering
-1.18
payment
-1.17
POSITIVE LOGITS
anium
1.98
️
1.68
actionDate
1.62
inki
1.56
iverse
1.51
rison
1.50
emark
1.49
ゼ
1.41
initialized
1.41
enhagen
1.36
Activations Density 0.010%