INDEX
Explanations
words related to historical events or the past
references to the past in relation to current events or discussions
New Auto-Interp
Negative Logits
starting
-0.70
Franch
-0.70
shapeshifter
-0.63
%]
-0.61
pur
-0.60
amber
-0.59
Pick
-0.58
utm
-0.57
Hi
-0.57
alert
-0.57
POSITIVE LOGITS
ebin
1.21
ime
1.13
tense
1.10
iche
1.05
orate
1.03
imes
1.00
decade
0.96
millennium
0.95
century
0.92
ures
0.86
Activations Density 0.052%