INDEX
Explanations
words related to positive opinions or conditions
words associated with positive evaluations and favorable conditions
New Auto-Interp
Negative Logits
agos
-0.84
hod
-0.79
ngth
-0.78
prototype
-0.78
lang
-0.76
lua
-0.76
bus
-0.76
hid
-0.75
grave
-0.74
rum
-0.74
POSITIVE LOGITS
matchups
1.11
favorable
1.08
favourable
0.97
avorable
0.96
conditions
0.89
margins
0.87
unfavorable
0.86
matchup
0.85
agre
0.85
arrangement
0.82
Activations Density 0.022%