INDEX
Explanations
instances of the name "Colin" in the text
New Auto-Interp
Negative Logits
visor
-0.76
rd
-0.73
Skydragon
-0.72
PE
-0.71
Wars
-0.69
ancial
-0.68
visors
-0.67
é¾įåĸļ士
-0.67
ECT
-0.67
natureconservancy
-0.66
POSITIVE LOGITS
onel
1.16
ossal
0.92
ossus
0.88
uations
0.83
iosis
0.83
onial
0.82
uation
0.81
ateral
0.81
oqu
0.80
oured
0.80
Activations Density 1.410%