INDEX
Explanations
references to physical media, such as print, paper books, and face-to-face interactions
New Auto-Interp
Negative Logits
ſelf
-0.80
pleaſure
-0.78
greateſt
-0.77
reaſon
-0.77
ſtate
-0.76
myſelf
-0.73
Портал
-0.72
itſelf
-0.71
seamnă
-0.69
fuite
-0.68
POSITIVE LOGITS
physical
0.92
offline
0.83
physical
0.71
Physical
0.71
PHYSICAL
0.70
Physical
0.70
physically
0.69
Offline
0.69
PHYSICAL
0.67
fisik
0.66
Activations Density 0.257%