INDEX
Explanations
references to specific time periods or events mentioned in a document
occurrences of the word "the" and phrases indicating time or duration
New Auto-Interp
Negative Logits
meal
-0.68
arth
-0.67
ragon
-0.67
marine
-0.64
Pad
-0.64
Cho
-0.63
Sax
-0.62
won
-0.61
egu
-0.61
arde
-0.60
POSITIVE LOGITS
sake
1.45
purposes
1.19
reasons
1.01
icion
0.86
meantime
0.85
example
0.85
instance
0.77
curious
0.73
Reasons
0.73
unin
0.71
Activations Density 0.095%