INDEX
Explanations
phrases indicating the author's personal opinion
expressions of personal opinions and views
New Auto-Interp
Negative Logits
ancial
-0.74
Brief
-0.65
Beaut
-0.62
bard
-0.62
uddin
-0.59
Start
-0.59
artney
-0.59
deposited
-0.58
Submit
-0.57
Starts
-0.57
POSITIVE LOGITS
Īè
0.79
·
0.77
©¶æ¥µ
0.72
naissance
0.71
§
0.70
ħĭ
0.69
©¶æ
0.69
uyomi
0.68
etheless
0.68
ĸ
0.68
Activations Density 0.073%