INDEX
Explanations
phrases indicating a high degree of emphasis or importance
the expression of emphasis in statements
New Auto-Interp
Negative Logits
neapolis
-0.77
rones
-0.73
GOODMAN
-0.71
venient
-0.67
æ°
-0.66
itures
-0.66
Many
-0.66
ERY
-0.65
OA
-0.65
ourse
-0.65
POSITIVE LOGITS
appreciated
1.16
alive
1.03
intact
0.87
alike
0.86
reliant
0.85
deserved
0.82
dependent
0.81
depended
0.80
regarded
0.79
resembled
0.79
Activations Density 0.046%