INDEX
Explanations
phrases that are emphasized or highlighted within the text
expressions of hopes and fears related to personal and familial situations
New Auto-Interp
Negative Logits
½
-0.74
purs
-0.72
argon
-0.72
cius
-0.68
glers
-0.66
zin
-0.65
Candidate
-0.65
oga
-0.64
agon
-0.63
obar
-0.62
POSITIVE LOGITS
Scroll
1.03
SPONSORED
0.86
Speaking
0.80
Actor
0.78
However
0.77
Shape
0.74
Yesterday
0.74
Scotland
0.73
Hundreds
0.72
Prof
0.72
Activations Density 0.446%