INDEX
Explanations
references to influential positions or prominent entities
occurrences of the word "vy" or related variants
New Auto-Interp
Negative Logits
Reviewed
-0.78
emate
-0.75
eared
-0.68
emark
-0.67
hetic
-0.66
aphael
-0.66
yrinth
-0.65
rador
-0.65
Contents
-0.64
Haunted
-0.63
POSITIVE LOGITS
vy
1.17
puff
0.85
yy
0.85
anka
0.83
eus
0.80
y
0.75
bags
0.73
hawks
0.69
plets
0.69
reys
0.69
Activations Density 0.025%