INDEX
Explanations
mentions of the number "19" or related temporal references
New Auto-Interp
Negative Logits
anguage
-0.78
orc
-0.78
aminer
-0.77
ensional
-0.71
ovie
-0.71
iated
-0.70
heed
-0.70
oaded
-0.66
Uriel
-0.64
pora
-0.63
POSITIVE LOGITS
th
1.09
âĸĪâĸĪ
0.99
03
0.92
08
0.91
059
0.91
05
0.90
09
0.90
06
0.89
07
0.89
61
0.89
Activations Density 0.019%