INDEX
Explanations
statements expressing agreement or disagreement regarding beliefs and opinions, particularly in social contexts
New Auto-Interp
Negative Logits
utie
-0.47
entials
-0.45
//
-0.45
foy
-0.45
MLLoader
-0.44
PMailer
-0.44
älde
-0.43
ptime
-0.43
#!/
-0.42
-0.41
POSITIVE LOGITS
agree
3.10
agrees
2.72
Agree
2.60
agreeing
2.59
agreement
2.55
agreed
2.55
agree
2.51
Agree
2.44
agreed
2.26
Agreed
2.24
Activations Density 0.410%