INDEX
Explanations
phrases related to rebukes or sharp criticisms
occurrences of the word "rebuff" and its variations indicating resistance or rejection
New Auto-Interp
Negative Logits
lihood
-0.90
Kafka
-0.77
âĢ¢âĢ¢
-0.76
Grail
-0.66
Flow
-0.66
Dome
-0.65
Valhalla
-0.64
meal
-0.63
Keeper
-0.62
teness
-0.61
POSITIVE LOGITS
anche
1.05
ounding
1.04
uffed
0.98
reb
0.96
ounded
0.92
uked
0.91
uls
0.91
aul
0.90
utations
0.89
becca
0.87
Activations Density 0.004%