INDEX
Explanations
entities related to different types of professions or roles within specific contexts
terms related to various groups of people affected by social or economic issues
New Auto-Interp
Negative Logits
charm
-0.66
fiasco
-0.65
indictment
-0.62
rumors
-0.61
rumours
-0.60
bernatorial
-0.60
rumor
-0.59
wonders
-0.56
plea
-0.56
sunshine
-0.55
POSITIVE LOGITS
alike
1.06
'
1.06
themselves
0.99
'.
0.96
who
0.94
pace
0.93
hip
0.92
folk
0.90
paces
0.88
irrespective
0.86
Activations Density 0.374%