INDEX
Explanations
phrases mentioning exclusivity or limited numbers of people
terms related to exclusivity and limited access
New Auto-Interp
Negative Logits
moil
-0.63
nowhere
-0.63
rus
-0.60
çľ
-0.59
auga
-0.58
apego
-0.58
nothing
-0.58
Translation
-0.58
DRAG
-0.57
çī
-0.57
POSITIVE LOGITS
survive
1.29
survives
1.22
ever
1.17
ever
1.15
anymore
1.11
survived
1.07
realise
1.04
bothered
1.01
dared
1.01
actually
0.99
Activations Density 0.224%