INDEX
Explanations
instances of the word "this."
New Auto-Interp
Head Attr Weights
0:0.09
1:0.08
2:0.08
3:0.10
4:0.07
5:0.09
6:0.07
7:0.09
8:0.07
9:0.08
10:0.07
11:0.07
Negative Logits
borders
-2.49
Baxter
-2.41
Mai
-2.38
pag
-2.37
forb
-2.36
Maze
-2.35
concent
-2.35
bamboo
-2.32
min
-2.27
irrig
-2.19
POSITIVE LOGITS
odic
2.85
uliffe
2.81
sembly
2.61
steen
2.54
Limbaugh
2.53
Conversation
2.53
ascript
2.51
pse
2.44
Entity
2.42
zsche
2.40
Activations Density 0.000%