INDEX
Explanations
references to historical concepts or events
New Auto-Interp
Negative Logits
omm
-0.16
tha
-0.15
onom
-0.14
Dies
-0.14
ERP
-0.13
irá
-0.13
sum
-0.13
antium
-0.13
Baker
-0.13
оÑĢаз
-0.13
POSITIVE LOGITS
trace
0.34
earliest
0.33
trace
0.30
traces
0.30
traced
0.30
history
0.28
first
0.28
began
0.28
istory
0.26
history
0.26
Activations Density 0.161%