INDEX
Explanations
positive or enthusiastic adjectives and phrases
expressions of strong positive sentiment or approval
New Auto-Interp
Negative Logits
Garcia
-0.53
fronts
-0.52
Marvin
-0.50
Greenwich
-0.49
unin
-0.49
Nicarag
-0.48
Kaufman
-0.48
Pentagon
-0.48
Shelby
-0.48
Melvin
-0.47
POSITIVE LOGITS
âĢ
1.45
âĢ
1.20
âĢº
1.17
[/
1.14
âľ
1.10
»
1.08
¨
1.08
ðŁij
1.05
ðŁ
1.04
âĹ
1.04
Activations Density 0.531%