INDEX
Explanations
dates in the format year-month-day with strong activations
dates and timestamps
New Auto-Interp
Negative Logits
avorite
-0.76
ometimes
-0.70
ilitary
-0.67
intercepted
-0.66
afety
-0.61
Rivals
-0.61
onite
-0.61
arming
-0.60
ña
-0.58
Illum
-0.58
POSITIVE LOGITS
02
0.79
actionDate
0.77
tnc
0.75
partName
0.75
01
0.73
displayText
0.73
08
0.70
09
0.69
rip
0.68
04
0.68
Activations Density 0.049%