INDEX
Explanations
repeated references to the word "the"
New Auto-Interp
Negative Logits
ensem
-0.16
inflamm
-0.15
ahoma
-0.15
.AppSettings
-0.14
rzy
-0.14
raÄį
-0.14
ıf
-0.14
صÙĨع
-0.14
soles
-0.13
asant
-0.13
POSITIVE LOGITS
standpoint
0.39
perspective
0.38
perspectives
0.28
Perspective
0.27
outset
0.27
depths
0.24
viewpoint
0.23
comfort
0.22
pers
0.22
beginning
0.22
Activations Density 0.101%