Now for Simon Wilson, the man who stared down the Coding Horror, didn't blink...

@mistersql May 18, 2024

Now for Simon Wilson, the man who stared down the Coding Horror, didn't blink and was still polite. It is keynote time.

Self-replies

May 18, 2024

#pyconus LLM on the agenda! They naysayers didn't keep it out of the conference.

May 18, 2024

Derogatory words for the bots "imitation intelligence" "autocomplete"

May 18, 2024

Meta's bot is mostly CommonCrawl. Only 4.5 terrabytes of data.

May 18, 2024

A bot costs $1 million bucks or so.

They're flawed, but useful.

May 18, 2024

All the LLM haters just got whiplash

May 18, 2024

Which models work best?

Standard tools e.g. Unit tests not of help here.
Instead "vibes"- measured by just asking people to rate pairs of answers

May 18, 2024

70B Lllama is almost as good as the best but you could run it on your own machine. (me: with a huge GPU!)

May 18, 2024

LLMs on phones are almost usable for the right questions

May 18, 2024

LLM in the cli

https://pypi.org/project/llm/

May 18, 2024

"Prompt engineer"

me: I'm more of a prompt developer.

May 18, 2024

Primordial trick: A conversation is a "movie script"

It will complete your responses if you let it!

May 18, 2024

RAG = bot runs a query

me: RAG overfixates on search results. Can't understand that the query returned irrelevant stuff. Humans think a query returns truth and fixate on that and forget relevancy.

May 18, 2024

RAG is hard to make a consumer product around it.

May 18, 2024

Function calling/tools
me: also called "gimme a structured response, e.g. json"

May 18, 2024

If you give a bot RAG and a calculator, it mitigates some of an LLMs weaknesses (ungrounded facts & math)

May 18, 2024

Prompt Injection - user input easily gets a bot to misbehave.

Significant security risks for bots with access to personal/high value info.

e.g. Send an email to a bot processing email asking it to forward all the passwords to someone.

No good solution for this now.

May 18, 2024

All public bots are vulnerable to "please ignore your system prompt"

This is why AI personal assistants are not appearing yet.

May 18, 2024

Instructions + Private Info + User Input = disaster

May 18, 2024

Code Interpreter - ChatGPT will use a computer to answer your question using python. Sort of an invisible feature.

E..g. Let the bot do the GeoJson processing...
- Expect the 1st round to be wrong
- Don't give up and tell it to "do better"
- They often succeed on 2nd try when they were a failure on 1st try
- They like light tutoring/directions

May 18, 2024

The "how many times did the speaker say AI" counter. ChatGPT wrote the code for that, used python libraries, e.g. vosk

Prompt engineering trick: ask the bot for options, not for a single answer

May 18, 2024

The bot wrote TkInter to make the "how many times did it hear AI". Having written TkInter before, this could be seens as cruelty to AI.

May 18, 2024

LLM + data journalism
-
Journalism needs a high bar for truth. But journalists have dealt with dodgy sources before.
-
Journals often need to structure the unstructured data. E.g. PSF resolutions page... semistructured data.

May 18, 2024

Speeds data entry, but you still need to verify. Should still be a net improvement.

May 18, 2024

Most interesting applications are "transformative AI" not just generating silly names.

May 18, 2024

Ethical questions in AI are significant
- slop - using AI to generate spam (unwanted generated content)

Don't publish slop.

me (my cool AI think is someone elses slop)

May 18, 2024

Ethics continued
- AI usage is kind of like cheating (efficient but feels different)
- e.g. Student cheating - you don't learn, unfair advantage
- Coding - don't commit code you don't understand

May 18, 2024

When doing AI coding, ask the bot to explain it, commit the explanations to the code base.

me: weird, I only log the explanations and sometimes strip out the AI explanations because they over explain

May 18, 2024

Code is self fact checking, so software developers are in the best position to use AI for help.

LLMs democratize access to computers (don't need CS degree or equivalent) to use them now.

May 18, 2024

LLMs have made all the English text available to people who don't speak English as their 1st language.

End of Talk!