INDEX
Explanations
instances of the word "that" to indicate emphasis or connection in sentences
New Auto-Interp
Negative Logits
Returns
-0.77
quet
-0.73
roth
-0.69
\\\\\\\\
-0.68
LY
-0.67
Ble
-0.64
Desk
-0.62
Fax
-0.61
EMBER
-0.61
ival
-0.60
POSITIVE LOGITS
comprise
1.24
surround
1.12
populate
1.09
constitute
1.03
inhabit
1.00
mattered
1.00
compose
0.99
plague
0.97
are
0.93
were
0.92
Activations Density 0.080%