INDEX
Explanations
expressions of personal opinions and subjective viewpoints
New Auto-Interp
Negative Logits
uzu
-0.15
lix
-0.15
λÏį
-0.14
Potential
-0.14
aspiring
-0.13
mgr
-0.13
odo
-0.13
tane
-0.13
Trem
-0.13
amburger
-0.13
POSITIVE LOGITS
impression
0.30
feeling
0.29
gut
0.24
impressions
0.24
experience
0.23
understanding
0.23
conclusion
0.21
assessment
0.21
view
0.20
feelings
0.20
Activations Density 0.228%