INDEX
Explanations
statements about personal experiences and reflections
the phrase "what it's like" or similar expressions describing subjective experiences or perspectives.
New Auto-Interp
Negative Logits
apapun
-0.42
anything
-0.40
anything
-0.39
Anything
-0.36
enapa
-0.36
miento
-0.36
égard
-0.35
qualquer
-0.34
Cualquier
-0.34
mengapa
-0.34
POSITIVE LOGITS
-------------</
0.58
nakalista
0.53
0.52
はこんな感じ
0.52
prawdzi
0.52
ecuted
0.52
Koordinaten
0.52
wahre
0.50
فريبيس
0.50
createState
0.50
Activations Density 0.129%