INDEX
Explanations
instances of the word "dubbed" and its variations, indicating names or labels given to individuals or events
New Auto-Interp
Head Attr Weights
0:0.02
1:0.02
2:0.06
3:0.09
4:0.13
5:0.03
6:0.05
7:0.35
8:0.03
9:0.03
10:0.07
11:0.08
Negative Logits
causation
-1.53
mentioned
-1.35
Purchase
-1.34
hover
-1.34
dism
-1.30
placement
-1.30
istries
-1.28
comprehension
-1.27
istry
-1.25
navigation
-1.25
POSITIVE LOGITS
iband
1.73
ッド
1.51
Í
1.43
黒
1.37
solete
1.33
louder
1.32
eni
1.31
ikers
1.30
loud
1.29
outlaw
1.28
Activations Density 0.003%