INDEX
    Explanations

    phrases indicating causation or consequences

    New Auto-Interp
    Negative Logits
    `{.
    -0.69
    -0.68
    >");
    
    -0.65
    redients
    -0.64
    "){
    
    -0.64
    Hop
    -0.63
    "));
    
    -0.62
    '):
    
    -0.61
    igenous
    -0.61
    >")
    -0.60
    POSITIVE LOGITS
     result
    0.79
    脚注の使い方
    0.79
     infolge
    0.78
     результате
    0.77
    
    0.70
     Folge
    0.69
     obstante
    0.69
    result
    0.67
     للاسماء
    0.67
     تضيفلها
    0.67
    Act Density 0.072%

    No Known Activations