INDEX
Explanations
phrases related to training or education
New Auto-Interp
Negative Logits
bum
-0.68
pard
-0.63
Uruguay
-0.63
adra
-0.61
headlines
-0.60
extrad
-0.60
govtrack
-0.59
caps
-0.58
headline
-0.58
rants
-0.58
POSITIVE LOGITS
instructors
1.03
regimen
1.01
manuals
0.99
learn
0.98
Learning
0.96
instructor
0.95
Instruct
0.93
instructional
0.92
manual
0.91
taught
0.91
Activations Density 5.551%