INDEX
Explanations
phrases related to control, authority, and conflict
conjunctions and linking terms that connect ideas and clauses in the text
New Auto-Interp
Negative Logits
appro
-0.73
ĸ
-0.64
iren
-0.59
emn
-0.58
surreal
-0.58
©¶æ
-0.58
endorsement
-0.57
Aid
-0.56
flagship
-0.56
amic
-0.56
POSITIVE LOGITS
selves
0.75
vous
0.72
Templ
0.71
Bullets
0.67
azy
0.66
itsch
0.63
mates
0.62
arent
0.60
inki
0.59
iblings
0.59
Activations Density 0.574%