INDEX
Explanations
expressions related to knowing or understanding a situation or task
phrases regarding individuals' awareness or competence in their actions
New Auto-Interp
Negative Logits
vig
-0.73
somew
-0.67
Awareness
-0.66
Seasons
-0.62
conceivable
-0.60
anytime
-0.59
awareness
-0.58
unavailable
-0.58
Vish
-0.58
mony
-0.58
POSITIVE LOGITS
ribe
0.84
sbm
0.80
talking
0.76
talking
0.73
doing
0.72
signing
0.71
addle
0.70
fw
0.69
/$
0.69
UCK
0.69
Activations Density 0.078%