INDEX
Explanations
positive comments and affirmations
sentences that express strong emotional sentiments or impactful statements
New Auto-Interp
Negative Logits
arching
-0.74
Afgh
-0.73
*/
-0.70
*/
-0.70
arisen
-0.69
conver
-0.65
endi
-0.65
ĵĺ
-0.64
frequency
-0.63
etheless
-0.63
POSITIVE LOGITS
quotes
0.86
Said
0.85
yrics
0.80
"...
0.78
omsky
0.75
reply
0.75
referring
0.70
quotation
0.70
replied
0.69
upon
0.69
Activations Density 0.518%