INDEX
Explanations
words related to historical events or moments in time
references to time periods in the past
New Auto-Interp
Negative Logits
wagon
-0.68
rapnel
-0.67
Franch
-0.66
hat
-0.65
atism
-0.63
govtrack
-0.63
Sharp
-0.62
utters
-0.60
Gun
-0.60
/#
-0.58
POSITIVE LOGITS
ebin
1.32
tense
1.05
decade
0.88
ĸļ
0.80
week
0.78
iche
0.75
orate
0.73
incarnation
0.72
month
0.71
semester
0.70
Activations Density 0.022%