Student Area

Dr. Mario – facereadings

One of the things I thought was really cool was this work by Bruno Munari:

It was meant to show how little we really needed to portray a face and it reminded me a lot of non-human cartoon characters that have weird forms but we can still tell where the face is and the emotions its showing.

 

The other thing I found interesting was the project Artifacial: https://artifacial.org/

It think its pretty weird to have an externally controlled face, and I have no idea what the uses would be, but it was cool to see the control they had on each different muscle in the face using the nodes. It must be really weird to be the guy getting his face shifted.

Koke_Cacao-facereadings

(06-FaceReadings)

Zach Lieberman, Más Que La Cara Overview (~12 minute read)
Kyle McDonald, Appropriating New Technologies: Face as Interface (~15 minute read)
Last Week Tonight: Face Recognition (21 minutes)
Joy Buolamwini: How I’m fighting bias in algorithms (9 minutes)
Nabil Hassein, Against Black Inclusion in Facial Recognition (~5 minutes)

Many issues of facial (biometric) recognition are highlighted in the above links including privacy concerns, surveillance concerns, and machine bias… I won’t reiterate them here.

Today, the combination of face detection with publicly available social network information can correctly predict your Facebook profile and the first five digits of your SSN for a third of the public, in under three seconds:
Our study is less about face recognition and more about privacy concerns raised by the convergence of various technologies. There is no obvious answer and solution to the privacy concerns raised by widely available face recognition and identified (or identifiable) facial images. Google’s Eric Schmidt observed that, in the future, young individuals may be entitled to change their names to disown youthful improprieties. It is much harder, however, to change someone’s face.

In reading one of those articles, the above sentences shocked me again although I have known this fact long ago (except for the 5 digits of the SSN part). While reading those articles, two thoughts came to my mind as a scalable way to combat facial recognition software.

  1. There are many adversarial tools for directly modifying the faces. One example is Adversarial Mask. However, those solutions are infeasible as people don’t even want to put on their masks for COVID precaution. We need a better solution.
  2. Facial Filters in Cameras: we can implement algorithms that recognize and modify detected faces slightly in the output image so that while my close friends can correctly identify me but the same task will be hard for computer databases that have millions of faces. This can be achievable because we, as humans, can only encode a small number of facial features in our long-term memory: the size of extracted facial features for the human brain should be way smaller than that for a computer. This way, if all faces can be slightly modified right after taking the picture, the entire Internet would contain fake faces which will make face searcher harder.

Now, think about it. If Facebook (ironically “face”-book, now got a better name), Google, or Apple can implement algorithms like this on their user frontend before any data got transmitted, the majority of images online will become unsearchable.

duq-VQGAN+CLIP

I spent quite a while trying to get the VQGAN+CLIP site to work, but I was completely unsuccessful. I instead used the Pixray readymade. I was surprised by how unrelated to the prompt the first couple of images were and then I found it to be a window into the way in which ML works to see how it takes that image and slowly makes it fit closer and closer to the prompt.

Solar-facereadings

When reading about Más Que La Cara Overview, I was surprised to learn about the small intricate details of considering the software and tools to successfully create a public, interactive installation which are often overlooked. It is critical to consider factors such as the exposure of light on the face influence facial detection in public spaces. The importance of this consideration relates to the fact that most facial recognition software are not racially diverse which surprised me. It was also incredibly humorous to me when watching Face Recognition and learning about how the limitations of facial recognition software led police to use actor Woody Harrelson’s face which was a look-alike to the thief in order to catch the thief.

Koke_Cacao-VQGAN+CLIP

Koke_Cacao-VQGAN+CLIP

(05-VQGAN+CLIP)

Using: wikiart_16384 + ViT-B/32 + default paremeter
Prompt: A student suffering from his coding homework

100 iterations
100 iterations
300 iterations
300 iterations

The image quality is not very good as other synthesizers based on image inputs (compared to models related to style
transfer) since the natural language processing pipeline restricted the latent space or because they are not trained
end-to-end. The style of the images generated can get very cliche very soon and the tool doesn’t give the artists
very much control over the generated image.

Koke_Cacao-TextSynthesis

Koke_Cacao-TextSynthesis

(05-TextSynthesis)

inferkit
narrative-device

Using: Narrative Device
Themes: coding, homework
Result: The teacher looked up from his computer screen and saw that his student had not yet arrived. “I’m going to
have to start grading your homework,” he said, “but I want to make sure you are doing it right.”
Comment: This has nothing to do with coding homework. The model interpret coding as teacher’s computer
screen. The result might by highly “plagiarized” from a story book.

Using: inferkit
Result:

The teacher looked up from his computer screen and saw that his student had not yet arrived. "I'm going to have to start grading your homework," he said, "but I want to make sure you are doing it right."

The student sighed, rolled his eyes, and said, "There's no point.

This is the ninth grade.

You'll do whatever I say and there's nothing I can do about it. "He sat down, and opened his book.

His teacher put his name in the "Now On and shook his head.

"I get to do this each and every night. Why can't you do it during the day?"

"Well," the teacher said, "when I set my alarm clock for noon, my friend buys me coffee.

I set it back to 5: 30 and I'm still awake.

It's got to be midnight by the time you go to bed.

There's no point in going to sleep."

The student sighed and closed his book.

Comment: The model only seems to pay attention to the previous 2 sentences and major characters. Although attention (to
characters) is all you need, but it is hard for the model to generate a global level theme of the text that is
consistent and relevant.

Koke_Cacao-ArtBreeder

Koke_Cacao-ArtBreeder

(05-ArtBreeder)
ArtBreeder
ProsePainter

Genre Picture
Genre Picture
Genre Parameter
Genre Parameter
Breeder Picture
Breeder Picture
Breeding Parameter
Breeding Parameter

The art breeder model is very cool. It magically preserves the lower-level pattern of oil paintings with relatively
high resolution. This quality is often, at least to me, hard to achieve and requires a lot of hyperparameter tuning
and GPU power. The fact that it can run on the browser this fast is amazing (is it running on a browser though? Or
does it use backend GPU with querying?). Also, it is very rare to have a user interface that allows both breeding
and gene-editing. I wonder if “breeding” means parameter breeding rather than pixel-wise breeding.

Koke_Cacao-Pix2Pix

Koke_Cacao-Pix2Pix

(05-Pix2Pix)

Phone Cat
Phone Cat
Phone Cat with Black Screen
Phone Cat with Black Screen

I have played this model years ago. The model tries very hard to fit both low-level pattern and border structure to
the input data, which creates funny-looking images. But the model is only good at interpreting thin lines, with
equal stroke weight and without color.