INDEX
Explanations
years mentioned in sentences
references to specific years or chronological events
New Auto-Interp
Negative Logits
Otherwise
-0.73
arah
-0.72
apest
-0.68
ãĥ¥
-0.68
ãĤ¦ãĤ¹
-0.68
ãĤ§
-0.67
////////////////
-0.67
ãĤ´ãĥ³
-0.63
]);
-0.62
alogy
-0.61
POSITIVE LOGITS
however
0.90
when
0.86
when
0.82
shortly
0.82
Newsweek
0.81
Forbes
0.78
upon
0.77
amid
0.77
while
0.75
according
0.71
Activations Density 0.165%