INDEX
Explanations
years mentioned in sentences, indicating events or situations
New Auto-Interp
Negative Logits
£ı
-0.81
teasp
-0.80
6000
-0.77
urat
-0.76
iltr
-0.73
soDeliveryDate
-0.72
rehens
-0.71
ufficient
-0.70
vest
-0.69
edin
-0.69
POSITIVE LOGITS
starters
0.99
bidden
0.86
mankind
0.84
everyone
0.81
geries
0.81
those
0.81
gotten
0.81
humankind
0.81
taxpayers
0.80
him
0.80
Activations Density 0.150%