INDEX
Explanations
relationships and contrasts between concepts or entities
New Auto-Interp
Negative Logits
sær
-0.53
enumi
-0.49
不止
-0.48
以外にも
-0.47
olm
-0.46
nejen
-0.46
antwo
-0.45
verschiedener
-0.45
Apakah
-0.44
%=
-0.43
POSITIVE LOGITS
Conversely
1.87
conversely
1.85
Conversely
1.78
meanwhile
1.62
Whereas
1.53
Whereas
1.52
Meanwhile
1.51
whereas
1.50
Meanwhile
1.48
whereas
1.47
Activations Density 0.398%