INDEX
Explanations
instances of the word "this" and related phrases
New Auto-Interp
Head Attr Weights
0:0.03
1:0.02
2:0.28
3:0.05
4:0.03
5:0.04
6:0.02
7:0.01
8:0.27
9:0.12
10:0.04
11:0.01
Negative Logits
arov
-1.28
ONSORED
-1.26
ynski
-1.25
ウス
-1.25
guessed
-1.22
ove
-1.21
efforts
-1.19
eele
-1.19
heny
-1.18
attempts
-1.15
POSITIVE LOGITS
DragonMagazine
1.33
Jungle
1.25
版
1.21
rete
1.21
Billboard
1.19
advertisement
1.18
itaire
1.17
runway
1.16
mares
1.15
Gauntlet
1.15
Activations Density 0.030%