INDEX
Explanations
occurrences of the word "the" in various contexts
New Auto-Interp
Head Attr Weights
0:0.15
1:0.03
2:0.04
3:0.07
4:0.07
5:0.05
6:0.23
7:0.02
8:0.05
9:0.16
10:0.02
11:0.04
Negative Logits
Galile
-2.82
herds
-2.79
Pony
-2.78
iTunes
-2.77
Lip
-2.75
lip
-2.61
mys
-2.56
Guest
-2.53
pony
-2.53
arg
-2.49
POSITIVE LOGITS
Herald
4.66
newspaper
4.20
Guardian
3.45
Nanto
3.37
erald
3.28
papers
3.27
paper
3.16
Telegraph
3.15
newspapers
3.06
Newspaper
3.01
Activations Density 0.013%