INDEX
Explanations
instances of the word "this" in various contexts
New Auto-Interp
Head Attr Weights
0:0.03
1:0.02
2:0.11
3:0.06
4:0.18
5:0.03
6:0.11
7:0.17
8:0.04
9:0.05
10:0.07
11:0.07
Negative Logits
Bridges
-1.43
accompanied
-1.36
acht
-1.32
devoted
-1.29
Publisher
-1.29
naire
-1.26
Achievement
-1.23
AMA
-1.22
Transparency
-1.22
adian
-1.21
POSITIVE LOGITS
opolis
1.55
umph
1.46
regenerate
1.41
onga
1.40
fray
1.40
pse
1.37
iframe
1.35
ople
1.35
inus
1.34
reckoning
1.33
Activations Density 0.001%