INDEX
Explanations
mentions of the word "school"
mentions of educational institutions and their related contexts
New Auto-Interp
Negative Logits
lihood
-0.69
tame
-0.68
timestamp
-0.66
\\\\
-0.64
vernment
-0.64
sheer
-0.62
dormant
-0.61
\":
-0.60
theless
-0.59
arial
-0.59
POSITIVE LOGITS
girls
1.10
children
1.07
kids
0.98
masters
0.97
master
0.95
©¶æ
0.94
neys
0.88
Students
0.88
boys
0.85
teachers
0.84
Activations Density 0.031%