INDEX
    Explanations

    formal terms related to policies and their implications

    New Auto-Interp
    Negative Logits
    ifrance
    -0.67
     ostavi
    -0.65
    <!--[
    -0.65
     дописавши
    -0.63
     Wikidata
    -0.58
    .
    -0.56
    .(*
    -0.55
    Földrajzportál
    -0.53
    enderror
    -0.53
    -0.51
    POSITIVE LOGITS
     of
    1.02
    ของ
    0.85
     των
    0.68
     của
    0.67
     της
    0.64
    сяг
    0.62
    følgelig
    0.61
     strøm
    0.60
    имость
    0.60
     متعلقه
    0.60
    Act Density 1.053%

    No Known Activations