INDEX
Explanations
occurrences of the word "During" followed by various numerical values
New Auto-Interp
Negative Logits
clud
-0.16
/by
-0.15
ORIES
-0.15
erah
-0.14
ynom
-0.14
sha
-0.14
olas
-0.14
hai
-0.14
aways
-0.13
vers
-0.13
POSITIVE LOGITS
RIX
0.17
ipa
0.16
же
0.16
radu
0.15
duk
0.15
linger
0.14
gne
0.14
ombre
0.14
illac
0.14
ografia
0.13
Activations Density 0.084%