INDEX
Explanations
mentions of studying or education
instances of the word "study" and its variations
New Auto-Interp
Negative Logits
ignant
-0.73
por
-0.68
trap
-0.64
cycl
-0.62
tail
-0.62
Noir
-0.61
lash
-0.60
Lann
-0.60
generic
-0.60
cade
-0.60
POSITIVE LOGITS
abroad
0.96
habits
0.75
arios
0.75
hemat
0.74
hran
0.74
courses
0.72
Journalism
0.70
umat
0.68
hammad
0.68
curric
0.68
Activations Density 0.034%