INDEX
Explanations
proper nouns and names
Tokens after a single letter name
names of people and places
New Auto-Interp
Negative Logits
Theſe
-0.52
Beſ
-0.51
ECAUSE
-0.50
Longo
-0.50
zke
-0.49
neutre
-0.46
Reſ
-0.46
leaſt
-0.46
marvin
-0.46
Houſe
-0.45
POSITIVE LOGITS
AssemblyProduct
0.70
BeginContext
0.68
انيف
0.67
Signalez
0.66
mouseY
0.66
AndEndTag
0.66
tagHelperRunner
0.63
חיצוניים
0.63
@[
0.61
IsMutable
0.61
Activations Density 0.326%