INDEX
Explanations
concepts that arise or reside
New Auto-Interp
Negative Logits
whereas
0.50
Basically
0.43
అయితే
0.43
instead
0.41
Although
0.41
basically
0.41
沒有
0.41
although
0.40
Unable
0.40
Whereas
0.40
POSITIVE LOGITS
comes
1.04
arises
0.99
lies
0.93
emerges
0.89
comes
0.87
lur
0.86
resides
0.84
rests
0.83
Comes
0.81
возникает
0.77
Activations Density 0.013%