INDEX
Explanations
mentions of specific names and titles
mentions of specific individuals and their significance in a context
New Auto-Interp
Negative Logits
redes
-0.67
cryptocurrency
-0.63
DATA
-0.60
watchdog
-0.59
homeowner
-0.58
HuffPost
-0.58
uncover
-0.57
prompts
-0.56
autonom
-0.55
urges
-0.55
POSITIVE LOGITS
blah
1.02
â̦"
0.96
..."
0.95
['
0.95
everybody
0.87
anybody
0.87
["
0.87
yeah
0.86
somebody
0.83
Godd
0.80
Activations Density 0.556%