INDEX
Explanations
when followed by you, I, we, it
New Auto-Interp
Negative Logits
لقد
0.39
щоб
0.38
是否
0.37
}^{[0.37
nascita
0.37
them
0.36
এক্ষেত্রে
0.36
bahawa
0.36
że
0.36
willingness
0.35
POSITIVE LOGITS
soever
0.74
confronted
0.73
जेव्हा
0.66
faced
0.60
considering
0.59
we
0.59
encountering
0.57
they
0.54
asked
0.54
ce
0.52
Activations Density 0.057%