INDEX
Explanations
the word "Knight"
repeated mentions of the name "Knight."
New Auto-Interp
Negative Logits
mble
-0.78
ional
-0.74
andem
-0.73
ience
-0.73
ascus
-0.73
iversal
-0.73
imester
-0.72
etsk
-0.72
ential
-0.70
obos
-0.68
POSITIVE LOGITS
Templar
1.23
mare
1.22
Knight
1.16
holder
1.06
mares
1.06
Riders
0.95
Knights
0.94
Rider
0.93
fish
0.91
Templ
0.89
Activations Density 0.011%