INDEX
Explanations
the word "the" along with related hypothetical or conditional phrases
New Auto-Interp
Head Attr Weights
0:0.08
1:0.08
2:0.07
3:0.08
4:0.08
5:0.08
6:0.09
7:0.07
8:0.07
9:0.09
10:0.08
11:0.07
Negative Logits
atever
-2.15
onds
-2.09
pelling
-2.06
said
-2.00
ongs
-1.96
eus
-1.95
Appearances
-1.93
taboola
-1.93
screenings
-1.92
poons
-1.91
POSITIVE LOGITS
Solitaire
2.50
racket
2.39
;;;;
2.37
Taj
2.36
yna
2.24
Sovere
2.23
volleyball
2.16
ratom
2.14
gymn
2.04
ply
2.03
Activations Density 0.000%