INDEX
Explanations
phrases related to instructions or announcements, especially related to events or organizations
terms related to applications or formal procedures
New Auto-Interp
Negative Logits
jack
-0.75
comb
-0.65
rock
-0.65
Hor
-0.64
Junior
-0.63
far
-0.63
meet
-0.61
grade
-0.60
confid
-0.60
win
-0.59
POSITIVE LOGITS
ATION
3.95
ATIONS
3.38
ATED
2.54
ATOR
2.28
ATING
2.22
ations
2.15
ATIONAL
2.11
ATIVE
2.10
ation
1.94
ATES
1.93
Activations Density 0.010%