INDEX
Explanations
phrases related to comparisons or options
phrases that introduce subsequent discussions or topics
New Auto-Interp
Negative Logits
oris
-0.78
vier
-0.75
sav
-0.74
contract
-0.69
cos
-0.68
imon
-0.67
ile
-0.66
estones
-0.66
nor
-0.66
orate
-0.65
POSITIVE LOGITS
raining
0.76
roaring
0.68
undone
0.67
flooding
0.66
pouring
0.66
closer
0.64
tro
0.61
comparing
0.61
tempting
0.59
toget
0.59
Activations Density 0.036%