INDEX
Explanations
references to the current time or present tense indicators
New Auto-Interp
Negative Logits
andex
-0.15
nze
-0.15
одо
-0.15
бо
-0.14
vik
-0.14
erva
-0.14
inator
-0.14
elo
-0.14
ARP
-0.14
uous
-0.13
POSITIVE LOGITS
ECC
0.15
Fry
0.14
itel
0.14
objective
0.14
ixel
0.13
iring
0.13
egl
0.13
Fan
0.13
sdk
0.13
infl
0.13
Activations Density 0.014%