INDEX
Explanations
dates in the month of February
dates, particularly in February
New Auto-Interp
Negative Logits
behavi
-0.65
cumbers
-0.57
opic
-0.57
elephant
-0.57
constitu
-0.56
rawdownloadcloneembedreportprint
-0.56
predec
-0.55
Palestin
-0.55
adm
-0.55
sacrific
-0.54
POSITIVE LOGITS
nd
1.25
uary
0.90
ruary
0.89
28
0.87
2019
0.86
Madness
0.86
iven
0.86
26
0.85
27
0.84
23
0.81
Activations Density 0.037%