INDEX
Explanations
discussions related to sentences and sentence structure
New Auto-Interp
Negative Logits
fleste
-0.72
Warburton
-0.69
icoot
-0.68
aussieht
-0.67
alamus
-0.66
dolu
-0.65
لاثة
-0.64
eryllium
-0.62
nicio
-0.61
rinfo
-0.61
POSITIVE LOGITS
sentences
1.54
sentence
1.49
Sentence
1.44
sentences
1.28
Sentences
1.26
Sentence
1.25
sentence
1.21
frase
0.99
paragraph
0.90
phrase
0.89
Activations Density 0.140%