INDEX
Explanations
dates and numbers mentioned in the text
occurrences of the number 11, especially in a historical or numerical context
New Auto-Interp
Negative Logits
PLIC
-0.66
è¯
-0.65
elo
-0.65
conduc
-0.63
meric
-0.62
conflic
-0.61
flows
-0.60
sers
-0.60
bler
-0.58
ãĥ¼ãĤ¯
-0.58
POSITIVE LOGITS
11
2.88
12
2.15
13
2.09
14
1.94
10
1.93
eleven
1.89
9
1.81
17
1.81
21
1.77
16
1.77
Activations Density 0.034%