INDEX
Explanations
positive evaluations of people or situations
New Auto-Interp
Negative Logits
caucus
-0.84
hadiran
-0.84
Serif
-0.80
osoba
-0.79
Divina
-0.77
lern
-0.77
Caucus
-0.77
InjectAttribute
-0.76
declarat
-0.76
रण
-0.75
POSITIVE LOGITS
good
1.60
GOOD
1.57
good
1.56
GOOD
1.56
Good
1.55
Good
1.51
Goodwin
1.24
Goodman
1.14
goods
0.98
buena
0.95
Activations Density 0.060%