INDEX
Explanations
sentences that emphasize unity and common ground among individuals, often using the context of discussing differences in background, beliefs, or preferences
New Auto-Interp
Negative Logits
umbnail
-0.68
ociate
-0.65
luaj
-0.64
iling
-0.63
76561
-0.63
ertodd
-0.62
izons
-0.59
rongh
-0.59
ukong
-0.59
nel
-0.58
POSITIVE LOGITS
raining
1.27
unclear
1.26
impossible
1.19
imperative
1.14
easier
1.11
ironic
1.07
doubtful
1.05
advisable
1.05
easy
1.04
conceivable
1.03
Activations Density 3.005%