INDEX
Explanations
words related to seeking or requesting something, often in a formal or official context
words related to requests or demands for something, particularly in a legal or formal context
New Auto-Interp
Negative Logits
cyn
-0.73
parity
-0.64
symmetry
-0.63
lessness
-0.63
Tyr
-0.62
variance
-0.61
observers
-0.61
Worlds
-0.60
vigilance
-0.60
glances
-0.59
POSITIVE LOGITS
ated
1.22
ating
1.05
ates
1.02
ate
1.00
ats
0.96
atin
0.96
ased
0.95
ational
0.94
ATED
0.93
ieve
0.93
Activations Density 0.084%