INDEX
Explanations
patterns consisting of repeated characters or symbols, indicative of certain formatting or structure in the text
New Auto-Interp
Negative Logits
دانشنامهٔ
-0.80
]}$
-0.78
PREDICT
-0.77
✨:
-0.76
AutoField
-0.76
closedir
-0.75
ReusableCell
-0.75
wieś
-0.74
שוליים
-0.74
énario
-0.71
POSITIVE LOGITS
----------------
1.87
---------------
1.12
------------
1.05
--------------
1.02
-------------
1.01
--------
0.96
-----------
0.95
---------
0.91
------
0.90
-------
0.85
Activations Density 0.268%