INDEX
Explanations
phrases contrasting two different perspectives or situations
the phrase "on the other hand."
New Auto-Interp
Negative Logits
=-=-=-=-=-=-=-=-
-0.70
Balkans
-0.64
onomy
-0.63
labs
-0.62
ted
-0.61
atro
-0.61
anus
-0.61
=-=-
-0.60
burgh
-0.60
otine
-0.58
POSITIVE LOGITS
SPONSORED
0.78
nings
0.76
âĶĢâĶĢ
0.70
oths
0.65
suppose
0.65
è£ħ
0.64
essage
0.63
invoke
0.63
ãĢģ
0.60
pherd
0.59
Activations Density 0.022%