INDEX
Explanations
references to the word "Barb" and its variations, indicating a focus on a specific entity or concept related to that term
New Auto-Interp
Negative Logits
VOL
-0.79
EVA
-0.76
lihood
-0.75
train
-0.74
nder
-0.73
CRIP
-0.72
EngineDebug
-0.71
æĸ¹
-0.68
VEN
-0.67
DER
-0.67
POSITIVE LOGITS
ados
1.29
izon
1.18
adian
1.10
adoes
1.06
assian
0.99
ican
0.99
arella
0.97
uda
0.96
attery
0.95
abies
0.95
Activations Density 0.003%