INDEX
Negative Logits
Consent
0.52
assent
0.45
Messaging
0.43
Martha
0.42
Mechanical
0.41
Ass
0.39
0.39
Quadr
0.39
MECHAN
0.39
Anne
0.38
POSITIVE LOGITS
out
0.73
out
0.68
stake
0.57
%>
0.53
%><%=
0.52
response
0.51
%>
0.50
Out
0.49
आउट
0.49
stake
0.47
Activations Density 0.003%