INDEX
Explanations
attends to tokens indicating a change or indication from tokens that suggest listing or detailing
New Auto-Interp
Head Attr Weights
0:0.06
1:0.07
2:0.06
3:0.11
4:0.09
5:0.03
6:0.43
7:0.12
Negative Logits
TabIndex
-0.44
Monfieur
-0.34
Diſ
-0.34
TÉCN
-0.33
Spons
-0.33
-0.33
Efq
-0.33
comuniques
-0.32
دانشنامهٔ
-0.32
Jefus
-0.32
POSITIVE LOGITS
);*/
0.36
();*/
0.35
}*/
0.34
*/
0.33
oprot
0.31
])));
0.31
}";
0.31
*/}
0.30
saraba
0.30
****/
0.30
Activations Density 1.512%