INDEX
Explanations
phrases indicating a decision or responsibility assigned to someone
occurrences of the phrase "up to," suggesting a focus on responsibility or decision-making
New Auto-Interp
Negative Logits
VICE
-0.60
Flavoring
-0.59
understatement
-0.59
thia
-0.58
diseng
-0.55
Untitled
-0.55
aves
-0.54
Frag
-0.54
Liter
-0.54
isha
-0.52
POSITIVE LOGITS
ended
1.03
vote
0.91
regulated
0.91
votes
0.90
swing
0.89
bra
0.89
sell
0.88
graded
0.87
rated
0.87
stairs
0.86
Activations Density 0.038%