INDEX
Explanations
phrases that include the expression "in short" or similar phrases that summarize a concept
phrases that emphasize a point or provide restatement
New Auto-Interp
Negative Logits
ãĥ¥
-0.71
Äĩ
-0.70
alysed
-0.68
aq
-0.67
col
-0.64
Rated
-0.63
REAM
-0.61
gi
-0.61
rals
-0.61
igators
-0.60
POSITIVE LOGITS
yss
0.88
unless
0.75
beware
0.70
imagine
0.68
yeah
0.65
congratulations
0.65
ymm
0.64
Whoever
0.64
whoever
0.63
barring
0.63
Activations Density 0.082%