INDEX
Explanations
phrases expressing comparison and importance
phrases indicating negative consequences or concerns regarding youth
New Auto-Interp
Negative Logits
DNA
-0.63
iste
-0.61
SHIP
-0.61
dinand
-0.60
ngth
-0.59
undy
-0.58
release
-0.56
eln
-0.56
"],
-0.56
acement
-0.56
POSITIVE LOGITS
importantly
1.05
interestingly
0.84
guessed
0.79
furthermore
0.79
Ħ¢
0.75
aside
0.75
incidentally
0.73
versely
0.72
moreover
0.70
meantime
0.70
Activations Density 0.837%