INDEX
Explanations
phrases indicating personal reflections or disclosures
New Auto-Interp
Negative Logits
__(/*!
-0.67
ftagPool
-0.60
Administrativna
-0.59
ebabkan
-0.56
rouvez
-0.56
]=>
-0.55
$_['
-0.55
Italijanski
-0.54
eorum
-0.54
ταν
-0.54
POSITIVE LOGITS
余談
1.07
quick
1.01
FYI
0.99
note
0.92
FYI
0.92
quick
0.87
Quick
0.86
siden
0.85
顺便
0.83
briefly
0.82
Activations Density 0.341%