INDEX
Explanations
phrases related to a unique or significant event happening for the first time
references to "first" occasions or events
New Auto-Interp
Negative Logits
maps
-0.71
Buildings
-0.66
riber
-0.63
love
-0.63
queue
-0.63
*/(
-0.63
loving
-0.62
ractor
-0.62
apons
-0.62
jon
-0.61
POSITIVE LOGITS
installment
0.93
indication
0.91
foray
0.91
casualty
0.78
attempt
0.77
hurdle
0.77
ever
0.75
omission
0.74
chance
0.74
culmination
0.74
Activations Density 0.097%