INDEX
Explanations
negations and expressions of disbelief or disappointment
Follows "no" or "<start_of_turn>"
no idiomatic phrase
New Auto-Interp
Negative Logits
onAttach
-0.37
CodedInputStream
-0.36
paire
-0.33
InitVars
-0.32
pair
-0.32
bingung
-0.32
jalá
-0.31
Nationalité
-0.31
mascarilla
-0.30
Budaya
-0.30
POSITIVE LOGITS
autorytatywna
0.76
rungsseite
0.62
AddTagHelper
0.61
Архівовано
0.60
GOTREF
0.60
ſehen
0.55
fashiola
0.55
UrlResolution
0.55
no
0.54
risen
0.54
Activations Density 0.117%