INDEX
Explanations
sentences that express the conclusion or summary of content
New Auto-Interp
Negative Logits
tampoco
-0.66
何より
-0.56
nemmeno
-0.53
ezért
-0.53
Поэтому
-0.53
therefore
-0.52
derfor
-0.51
therefore
-0.51
kimse
-0.51
难怪
-0.50
POSITIVE LOGITS
Allí
1.00
darin
0.94
Briefly
0.92
therein
0.91
wherein
0.90
在这个
0.83
Titled
0.80
فيه
0.79
ніципалі
0.79
इसमें
0.78
Activations Density 0.603%