INDEX
Explanations
themes related to societal issues and hypocrisy in discussions
New Auto-Interp
Negative Logits
ذÙĦÙĥ
-0.15
agos
-0.14
oproject
-0.14
immel
-0.13
fts
-0.13
Ì£
-0.13
.Enums
-0.13
ritte
-0.13
uhe
-0.13
áºł
-0.12
POSITIVE LOGITS
these
0.72
these
0.65
THESE
0.55
è¿ĻäºĽ
0.52
These
0.50
These
0.49
today
0.44
ÑįÑĤиÑħ
0.44
tÄĽchto
0.42
ÑĨиÑħ
0.40
Activations Density 0.978%