INDEX
Explanations
adjectives describing opinions or evaluations of experience
adjectives and adverbs that convey positive and negative evaluations
New Auto-Interp
Negative Logits
ãĥĺãĥ©
-0.71
asio
-0.66
adj
-0.66
Abstract
-0.66
uce
-0.60
Domain
-0.60
ãĥ¯ãĥ³
-0.59
bert
-0.59
uction
-0.59
################################
-0.58
POSITIVE LOGITS
lately
1.18
fruitful
0.99
since
0.99
unsuccessful
0.88
successful
0.82
steady
0.81
dogged
0.77
steadily
0.76
consistent
0.74
productive
0.72
Activations Density 0.268%