INDEX
Explanations
references to specific individuals, particularly those with the name "Kat" or variations thereof
New Auto-Interp
Negative Logits
ollo
-0.15
Lac
-0.14
iciary
-0.14
eca
-0.14
sac
-0.14
ICI
-0.14
539
-0.14
ắt
-0.14
["_
-0.14
847
-0.13
POSITIVE LOGITS
owitz
0.19
rink
0.17
ziej
0.17
vice
0.15
Flash
0.14
ainen
0.14
ä»ĺ
0.14
妮
0.14
apolis
0.14
ار
0.14
Activations Density 0.032%