INDEX
Explanations
mentions of first-time events or accomplishments
occurrences of the word "first."
New Auto-Interp
Negative Logits
itself
-0.89
Canaver
-0.82
oneself
-0.76
Guan
-0.74
rish
-0.74
halla
-0.70
Mahm
-0.70
Mous
-0.70
arth
-0.70
Committees
-0.70
POSITIVE LOGITS
foray
0.98
outing
0.89
career
0.83
cousin
0.82
incarnation
0.79
comeback
0.78
birthday
0.78
counterparts
0.77
mortal
0.77
adversary
0.75
Activations Density 0.239%