Lightning Talk - CONDG

Central Ohio .NET Developer's Group

July 2024

Meetup: meetup.com/Central-Ohio-NET-Developers-Group-CONDG

YouTube: youtube.com/@CONDG

 


 

Lightning Talk

Lightning Talk image 1

Prompt: Create a cartoon image representing the term "lightning talk."

 


 

Lightning Talk?

Lightning Talk image 2

 


 

Blitz Babble

Blitz Babble image

Prompt: Replace "lightning talk" with "blitz babble."

 


 

whoami

Bill Melvin

 


 

 


 

Good Morning

Your stuff is floating

Picture of flooded cellar

 


 

Pump out the water

Picture of pumping out the water

 


 

Aftermath

Picture of workbench

 


 

The Sad Case of the Drowned Books

Drowning books image

 


 

Books do not like water

Picture of books in workbench shelf

 


 

Books were not the priority

Picture items drying

 


 

Books did not fare well

Picture of damp books on shelf

 


 

 


 

Time to Recycle

Moldy book cover

Moldy books are not appealing

 


 

Before Recycling

"I know that book is around here somewhere..."

Took pictures, for reference, in case I want to know if a book I remember having was lost in the not-so-Great Cellar Flood of 2019.

 


 

 


 

Catalog the books

Rather than having to look through all the pictures, a list of the books would be more convenient.

Why spend a few hours doing a thing by hand when you can spend days automating it?

 


 

Tesseract OCR

Optical Character Recognition

GitHub: tesseract

Per Wikipedia:

Tesseract is an optical character recognition engine for various operating systems. It is free software, released under the Apache License. Originally developed by Hewlett-Packard as proprietary software in the 1980s, it was released as open source in 2005 and development was sponsored by Google in 2006.

In 2006, Tesseract was considered one of the most accurate open-source OCR engines available.

 


 

Use Python to automate Tesseract

 


 

Process images for better results

When the pictures were taken, using them for OCR was not considered.

Many of the images have poor focus, and the books are moldy and wrinkled.

Some initial processing on the images was done to try to help the OCR process.

The Pillow package, a fork of the Python Imaging Library, was used for image processing.

 


 

Make sharpened color and grayscale versions

Used a Python script to create two versions of the original image, both sharpened and scaled, and one converted to grayscale. The script read the list of images from a .csv file and applied the following to each row (some interim steps skipped):

prep_images_1.py

 


 

Tesseract results

IMG_0640

Ss Foreword by

Hitchhiker's Guide
to Visual Studio and
SOL Server

Seventh Edition

=
5

ati

oe ie
bit SIR ail

 


 

 


 

Matt Eland

Matt Eland image

Matt Eland has websites. Here are a few of them:

 


 

Matt Eland's CONDG Talk

On 2022-08-24, Matt gave a talk called Automating my Dog with Azure Cognitive Services

Matt Eland CONDG talk

 


 

 


 

Try Azure OCR

 


 

Create Azure Resource

Documentation Links

Note: Microsoft has renamed Form Recognizer to Document Intelligence.

 


 

Document Intelligence / Form Recognizer (1)

screenshot

 


 

Document Intelligence / Form Recognizer (2)

screenshot

 


 

Document Intelligence / Form Recognizer (3)

screenshot

 


 

Document Intelligence / Form Recognizer (4)

screenshot

 


 

Console Application

Console application does the following:

Source code is on GitHub: wmelvin/try-azure-ocr

 


 

NuGet Packages

AzureOCRConsoleApp.csproj

csproj-1.xml

 


 

Settings File

azure-ocr-settings.json

settings-1.json

 


 

Settings Class

Settings.cs

settings-2.cs

 


 

Settings Method

settings-3.cs

 


 

Create Credential and Client

Create an AzureKeyCredential and DocumentAnalysisClient using the key and endpoint.

client.cs

 


 

Process Image

process-image.cs

 


 

Form Recognizer results

IMG_0640

Foreword by Bob Muglia
Windows Server System
Hitchhiker's Guide
to Visual Studio and
SQL Server
Seventh Edition
Best Practice Architectures and Exames
AUTHORITATIVE GUIDES FOR MICROSOFT WINDOWS AND SERVER PROFESIONALS
William R. Vaughn
with Peter Blackburn

 


 

Tesseract results (for comparison)

IMG_0640

Ss Foreword by

Hitchhiker's Guide
to Visual Studio and
SOL Server

Seventh Edition

=
5

ati

oe ie
bit SIR ail

 


 

What's Next?

I didn't make it as far in this exploration as I had hoped prior to this talk.

What I have done so far uses tools that recognize text in the images.

There are now tools, leveraging LLMs, that can recognize the books in the images.

I expect those tools can provide much better recognition and results than anything I have tried so far.

 


 

Simon Willison

GitHub: github.com/simonw

Detecting books in video

YouTube: Using AI for Journalism

Blog Post: The killer app of Gemini Pro 1.5 is video

 


 

Anders Hejlsberg

Wikipedia: Anders Hejlsberg

Interview on YouTube: From Turbo Pascal to Delphi to C# to TypeScript...

 


 

Hanselminutes

Hanselminutes: Introducing TypeChat with Anders Hejlsberg

Podcast screenshot

 


 

TypeChat

Blog: Introducing TypeChat

GitHub: TypeChat

Typechat screenshot

To be continued...

 


 

Books I was sorry to lose

 


 

The Firebird Book

I used the Firebird database server on the job for an internal application. It just worked - as they say.

book image Firebird

 


 

Turbo Pascal Techniques and Utilities

This book was very helpful when I was making a living programming in Turbo Pascal.

book image TPTech

 


 

The C Programming Language

I never did much C programming, but this book was a classic.

book image KandR

 


 

Microcontroller Handbook

Early in my programming career, I wrote some microcontroller assembly language. Besides being a necessary reference, these manufacturer's handbooks were somehow fascinating.

book image Micro

 


 

Programmer's Guide to the IBM PC

The Peter Norton classic. Very useful back when knowing interrupt request (IRQ) numbers mattered.

book image Norton

 


 

Programming Windows

The Charles Petzold classic.

book image Petzold

 


 

REBOL for Dummies

I used REBOL on the job for one thing, once, a long time ago.

book image REBOL

 


 

That's All

Thank you

wmelvin.dev