INDEX
Explanations
instances where a word is emphasized or highlighted in a text
quotes or phrases that convey strong opinions or sentiments
New Auto-Interp
Negative Logits
Ͻ
-0.84
İĭ
-0.81
Ń·
-0.80
bris
-0.68
tein
-0.67
ĻĤ
-0.66
isl
-0.66
nas
-0.66
²¾
-0.65
»Ĵ
-0.65
POSITIVE LOGITS
Journal
0.68
translates
0.68
Greenpeace
0.67
/"
0.66
reads
0.65
++
0.65
refers
0.65
GOODMAN
0.61
Blog
0.60
trailed
0.58
Activations Density 0.146%