INDEX
Explanations
expressions indicating awareness and understanding of situations
New Auto-Interp
Negative Logits
Monfieur
-0.98
-0.86
AddAttribute
-0.82
dAtA
-0.79
Shakspeare
-0.79
CompleteListener
-0.78
Pokies
-0.76
XNUMX
-0.75
poffible
-0.71
pleaſure
-0.71
POSITIVE LOGITS
know
1.15
knows
1.12
know
1.05
Know
0.92
Know
0.92
knew
0.90
understands
0.89
knows
0.88
recognize
0.88
understand
0.87
Activations Density 0.281%