INDEX
Explanations
the presence of the word "burger" or its variations
Burger and user input
New Auto-Interp
Negative Logits
|)
-0.40
Vorlage
-0.39
)|
-0.39
|\
-0.36
已
-0.35
Faith
-0.34
]|
-0.33
besch
-0.33
},[])
-0.32
)
-0.31
POSITIVE LOGITS
Burger
2.64
Burger
2.64
burger
2.44
burger
2.27
Burgers
1.84
burgers
1.76
burgers
1.75
eburger
1.59
hamburger
1.34
urger
1.30
Activations Density 0.002%