INDEX
Explanations
recognize, acknowledge, embrace
New Auto-Interp
Negative Logits
obtains
0.46
obtain
0.43
computes
0.42
obtient
0.42
spawning
0.41
complies
0.41
produces
0.40
получи
0.40
produce
0.40
compliance
0.39
POSITIVE LOGITS
Recogn
0.88
Recognizing
0.84
Recognize
0.83
embrace
0.82
embracing
0.82
recognizing
0.80
recognize
0.80
Embrace
0.79
acknowledge
0.78
осозна
0.75
Activations Density 0.030%