INDEX
Explanations
phrases indicating acknowledgment or consideration
New Auto-Interp
Negative Logits
Tale
-0.67
tatt
-0.66
Pengu
-0.64
suspic
-0.63
kan
-0.62
Yose
-0.61
Courage
-0.60
icht
-0.59
è£
-0.59
snipp
-0.59
POSITIVE LOGITS
regards
0.95
regard
0.74
aning
0.71
reference
0.70
equality
0.70
permissions
0.69
forward
0.69
thereto
0.68
ranging
0.68
asers
0.67
Activations Density 0.013%