INDEX
Explanations
locations or settings
sentences that describe the status or characteristics of objects or events
New Auto-Interp
Negative Logits
uld
-0.67
dies
-0.65
ue
-0.62
parties
-0.61
selves
-0.61
rities
-0.61
aths
-0.61
Spirits
-0.60
sheet
-0.58
mosp
-0.58
POSITIVE LOGITS
sorely
0.79
tremendously
0.78
extremely
0.76
extraordinarily
0.76
blat
0.75
awfully
0.75
definitely
0.75
immensely
0.74
greatly
0.74
strikingly
0.74
Activations Density 0.339%