INDEX
Explanations
phrases indicating advice or information
phrases expressing doubt or negation
New Auto-Interp
Negative Logits
Forums
-0.70
é¾įå¥ij士
-0.65
Principles
-0.63
MpServer
-0.60
integrity
-0.60
affinity
-0.60
Graphics
-0.57
Altern
-0.56
Adin
-0.56
igor
-0.56
POSITIVE LOGITS
hear
0.87
expect
0.86
yourselves
0.80
underestimate
0.79
need
0.79
realise
0.76
know
0.76
plin
0.75
swer
0.74
necessarily
0.73
Activations Density 0.174%