INDEX
Explanations
individuals or entities described as vocal or outspoken
terms related to strong opinions or vocal expressions
New Auto-Interp
Negative Logits
pty
-0.89
ovember
-0.87
uden
-0.85
ysc
-0.83
Reincarn
-0.79
ramid
-0.75
akeru
-0.69
ueller
-0.69
atch
-0.68
VEL
-0.67
POSITIVE LOGITS
ness
0.93
hip
0.90
outspoken
0.89
critic
0.86
ly
0.85
streak
0.82
personalities
0.74
odox
0.72
frontrunner
0.71
exponent
0.71
Activations Density 0.032%