INDEX
Explanations
dates and numerical references
significant dates, locations, and entities related to events or organizations
New Auto-Interp
Negative Logits
assorted
-0.60
accompanied
-0.57
ãĥ©ãĥ³
-0.56
horm
-0.55
Magn
-0.54
MODE
-0.52
Arkham
-0.52
little
-0.51
Minority
-0.51
onomy
-0.50
POSITIVE LOGITS
nor
2.24
anymore
2.23
nor
1.52
whatsoever
1.39
unless
1.35
yet
1.31
Nor
1.17
yet
1.15
unless
1.14
except
1.09
Activations Density 0.999%