INDEX
Explanations
a combination of some English words seemingly related to political subjects or personal names, and some non-English words or abbreviations
New Auto-Interp
Negative Logits
Martin
-0.53
Martin
-0.52
rec
-0.52
Rec
-0.49
©️
-0.47
rec
-0.46
Wikimedijinoj
-0.44
martin
-0.43
juice
-0.42
rek
-0.42
POSITIVE LOGITS
thasone
0.72
webElementXpaths
0.71
localctx
0.69
ویکیپدیا
0.65
__':
0.65
uests
0.63
__":
0.63
aronder
0.62
principalTable
0.62
*/),
0.62
Activations Density 16.764%