INDEX
Explanations
phrases or clauses starting with "anything that" or "everything that"
phrases that begin with "that" indicating specificity or clarification
New Auto-Interp
Negative Logits
mi
-0.70
aq
-0.70
ply
-0.67
roth
-0.65
gur
-0.65
pling
-0.62
lying
-0.60
ãĥ¥
-0.60
united
-0.60
roy
-0.59
POSITIVE LOGITS
happens
1.17
happened
1.09
occurs
0.99
arose
0.97
transpired
0.93
happen
0.92
affects
0.91
entails
0.91
happ
0.91
separates
0.90
Activations Density 0.162%