INDEX
Explanations
references to time periods, specifically the past
references to time, especially past events
New Auto-Interp
Negative Logits
anguage
-0.81
Software
-0.74
hz
-0.72
hyde
-0.68
Services
-0.64
shapeshifter
-0.64
NEY
-0.63
igion
-0.63
witz
-0.63
horn
-0.63
POSITIVE LOGITS
ebin
1.38
tense
0.94
iche
0.90
generations
0.85
past
0.78
week
0.76
midnight
0.76
decade
0.72
ures
0.67
ime
0.67
Activations Density 0.031%