INDEX
Explanations
expressions of personal opinions
New Auto-Interp
Negative Logits
ſelf
-0.59
purpoſe
-0.53
ſch
-0.52
faſt
-0.51
auffi
-0.51
ſind
-0.50
Intern
-0.50
houſe
-0.50
ſever
-0.50
himſelf
-0.49
POSITIVE LOGITS
opinion
1.59
Opinion
1.41
opinions
1.35
Opinion
1.32
opinion
1.30
Opinions
1.27
OPINION
1.24
opinión
1.23
Opinions
1.14
opinião
1.11
Activations Density 0.371%