INDEX
Explanations
words related to financial matters, such as earnings and investments
words related to earnings or financial gains
New Auto-Interp
Negative Logits
naires
-0.75
Butterfly
-0.66
ktop
-0.64
Bastard
-0.62
Andromeda
-0.60
Tsuk
-0.60
moderation
-0.59
Omaha
-0.59
dots
-0.58
bis
-0.58
POSITIVE LOGITS
nings
1.26
lier
1.14
ning
1.14
ls
1.03
ns
0.99
thing
0.98
nce
0.98
cy
0.95
liest
0.93
ving
0.91
Activations Density 0.071%