INDEX
Explanations
No Explanations Found
New Auto-Interp
Negative Logits
socalled
0.78
نکن
0.70
isieren
0.66
longterm
0.66
﹗
0.65
کنیم
0.63
हेलो
0.62
======
0.61
thawed
0.60
सीना
0.60
POSITIVE LOGITS
1.17
1.08
1.05
1.01
0.97
0.94
................
0.88
0.86
………………………………
0.84
0.83
Activations Density 0.004%