INDEX
Explanations
words related to time and continuous action
New Auto-Interp
Negative Logits
pour
-0.78
puff
-0.66
Desc
-0.66
eny
-0.65
eri
-0.65
esta
-0.64
VP
-0.64
VG
-0.64
Diary
-0.64
cart
-0.63
POSITIVE LOGITS
proven
0.87
adulthood
0.78
AFTER
0.74
soever
0.74
someone
0.71
reinforcements
0.70
2024
0.70
they
0.70
sunrise
0.69
somebody
0.68
Activations Density 1.125%