INDEX
Explanations
themes related to identity and social issues
New Auto-Interp
Negative Logits
To
-0.15
since
-0.14
Of
-0.14
-To
-0.14
à¤ľà¤¬à¤ķ
-0.14
whereas
-0.13
although
-0.13
-Man
-0.13
ToPoint
-0.13
And
-0.13
POSITIVE LOGITS
Your
0.38
Those
0.35
Their
0.35
These
0.35
Each
0.34
Our
0.32
Some
0.31
Your
0.30
Various
0.30
Several
0.28
Activations Density 0.268%