INDEX
Explanations
pronominal and possessive pronoun pairings followed by a verb
phrases indicating subjective opinions and expressions
New Auto-Interp
Negative Logits
[-
-0.71
andise
-0.70
Daly
-0.67
ãĤ½
-0.66
ume
-0.64
Mass
-0.62
clearance
-0.62
Clicker
-0.62
ãĢIJ
-0.61
BB
-0.61
POSITIVE LOGITS
rosso
0.82
somew
0.79
someday
0.78
misunder
0.73
wiser
0.70
cheat
0.70
rha
0.70
awaru
0.69
inadvert
0.67
swayed
0.65
Activations Density 0.198%