INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    θο
    -0.08
    	fi
    -0.08
    gol
    -0.08
    Iyo
    -0.08
     gefragt
    -0.08
     snabb
    -0.07
    gev
    -0.07
     Olsen
    -0.07
     aufge
    -0.07
    	file
    -0.07
    POSITIVE LOGITS
     intermediate
    0.08
     fur
    0.07
     cavities
    0.07
     auxiliary
    0.07
     analis
    0.07
     знамен
    0.07
     antim
    0.07
     güz
    0.07
     провод
    0.07
     principle
    0.07
    Act Density 0.009%

    No Known Activations