INDEX
Explanations
terms related to naming or renaming entities
New Auto-Interp
Negative Logits
Efq
-0.95
للمعارف
-0.87
myſelf
-0.79
Theſe
-0.77
contextLoads
-0.76
Majefty
-0.75
ſta
-0.74
iſt
-0.74
Anſ
-0.74
purpoſe
-0.74
POSITIVE LOGITS
name
1.77
names
1.57
Name
1.47
name
1.47
NAME
1.38
Names
1.32
Name
1.30
名字
1.27
named
1.26
Namen
1.26
Activations Density 0.250%