INDEX
Explanations
references to time, specifically focusing on the word "recent" and its variations
New Auto-Interp
Negative Logits
ãģ°
-0.17
ujet
-0.15
ÄŁan
-0.14
avir
-0.14
raison
-0.14
orra
-0.14
аков
-0.14
elsing
-0.14
\<^
-0.14
ual
-0.13
POSITIVE LOGITS
imes
0.17
/current
0.16
lately
0.16
ighbor
0.16
zos
0.15
iembre
0.15
ìĶ©
0.15
ismet
0.15
ifle
0.14
-built
0.14
Activations Density 0.022%