INDEX
Explanations
references to early time periods or beginnings
New Auto-Interp
Negative Logits
utenberg
-0.80
edIn
-0.77
ICLE
-0.72
Pwr
-0.69
ikk
-0.66
atism
-0.64
roma
-0.64
ittee
-0.63
lde
-0.63
alach
-0.62
POSITIVE LOGITS
mornings
0.99
morning
0.99
afternoon
0.98
twenties
0.91
evening
0.86
adop
0.85
stages
0.84
enough
0.81
versions
0.80
childhood
0.78
Activations Density 2.766%