INDEX
Explanations
phrases indicating a strong emphasis or assertion
the word "that" in various contexts
New Auto-Interp
Negative Logits
WARD
-0.67
emis
-0.64
Veter
-0.60
Prepar
-0.60
PRESS
-0.59
Acknowled
-0.59
Targ
-0.59
Tax
-0.58
Luck
-0.58
èĪ
-0.58
POSITIVE LOGITS
cher
0.91
chers
0.85
ched
0.80
ching
0.79
fateful
0.75
lasted
0.73
damned
0.71
pesky
0.71
ÅĤ
0.66
fundament
0.63
Activations Density 0.124%