INDEX
Explanations
phrases starting with 'We' or similar pronouns
pronouns indicating personal or collective perspectives
New Auto-Interp
Negative Logits
Leilan
-0.64
ãĥ¼ãĥĨ
-0.61
goto
-0.53
outwe
-0.51
Constantin
-0.50
cradle
-0.50
ioxide
-0.49
Reef
-0.46
reef
-0.46
sylv
-0.46
POSITIVE LOGITS
jriwal
0.75
estine
0.62
sect
0.58
vernment
0.58
Dat
0.58
anmar
0.57
Bus
0.57
anyahu
0.56
NetMessage
0.55
agar
0.55
Activations Density 0.228%