INDEX
Explanations
expressions of personal opinions and perspectives
New Auto-Interp
Negative Logits
Xuân
-0.61
⚭
-0.59
cherry
-0.57
inator
-0.57
confirmación
-0.56
Laughs
-0.56
```
-0.56
UFACT
-0.55
Corpor
-0.55
hidupan
-0.55
POSITIVE LOGITS
believes
0.89
believed
0.88
thinks
0.87
believe
0.82
think
0.81
believe
0.74
تانيه
0.73
think
0.72
Opinion
0.71
belie
0.71
Activations Density 0.404%