INDEX
Explanations
statements indicating personal opinion or intention
New Auto-Interp
Negative Logits
Hebdo
-0.47
Reviewer
-0.46
taboola
-0.46
rab
-0.46
Scores
-0.43
itles
-0.43
endez
-0.43
scree
-0.42
lation
-0.41
rawdownloadcloneembedreportprint
-0.41
POSITIVE LOGITS
myself
0.67
é¾įå
0.58
ishi
0.56
ĸļ
0.54
personally
0.53
cious
0.52
¿½
0.50
na
0.50
igo
0.49
fortunate
0.48
Activations Density 5.337%