INDEX
Explanations
phrases indicating personal intentions or goals
phrases that express intentions, goals, or responses
New Auto-Interp
Negative Logits
themselves
-0.74
idates
-0.68
ierrez
-0.63
yourselves
-0.63
ikhail
-0.62
endez
-0.61
-+-+
-0.60
ãħĭ
-0.58
ãĤª
-0.58
Leone
-0.57
POSITIVE LOGITS
colleague
0.76
husband
0.74
favorite
0.73
ventures
0.72
thesis
0.68
stic
0.68
myself
0.67
planner
0.67
collection
0.66
arest
0.66
Activations Density 0.252%