INDEX
Explanations
instances of the word "speak" and its variations, indicating discussion or communication
New Auto-Interp
Negative Logits
原始内容存档于
-0.40
faithful
-0.38
InputBorder
-0.38
insegna
-0.38
oma
-0.34
łoż
-0.33
Hammer
-0.32
CHR
-0.32
Hvordan
-0.32
gering
-0.32
POSITIVE LOGITS
volumes
0.79
truth
0.78
fluent
0.73
highly
0.73
Highly
0.66
Highly
0.66
louder
0.65
Volumes
0.64
highly
0.63
CURIAM
0.63
Activations Density 0.151%