INDEX
Explanations
dates in the format month year with high activations for days in some cases
numerical values associated with dates and times
New Auto-Interp
Negative Logits
gall
-0.70
ihad
-0.69
Appl
-0.64
ËĪ
-0.62
oun
-0.62
aroused
-0.60
ó
-0.60
âĸ¬
-0.60
anat
-0.60
uits
-0.60
POSITIVE LOGITS
ा
0.72
bis
0.68
PAC
0.61
intendent
0.61
åħī
0.61
Coach
0.59
backer
0.56
asp
0.56
503
0.56
apest
0.55
Activations Density 0.191%