INDEX
Explanations
topics or categories mentioned in a text or document
topic headers and keywords within a document
New Auto-Interp
Negative Logits
ĸļ
-0.77
¯
-0.77
IELD
-0.74
subsequ
-0.73
erred
-0.71
rites
-0.69
athing
-0.69
ornia
-0.68
rier
-0.67
ards
-0.67
POSITIVE LOGITS
Mandatory
0.66
Become
0.64
Various
0.61
?]
0.59
POLIT
0.59
Provided
0.57
Cosponsors
0.57
econom
0.56
govtrack
0.56
education
0.56
Activations Density 0.035%