INDEX
Explanations
references to living beings or life
living contexts
New Auto-Interp
Negative Logits
<bos>
-0.65
deleteById
-0.59
Shorts
-0.56
Autriche
-0.56
Falcons
-0.56
ếm
-0.54
Chapelle
-0.53
Falcon
-0.53
Motorcycles
-0.53
оне
-0.52
POSITIVE LOGITS
Living
1.11
living
1.09
living
1.05
Living
1.04
LIVING
1.00
LIVING
0.95
Livingston
0.71
Hidup
0.71
Liv
0.70
Liv
0.66
Activations Density 0.010%