Tag Archives: aiml

A comprehensive list of 2024 AI predictions 👽

The global AI market was already worth more than US$150 billion by the end of 2023. According to one of the reports, the global AI market will reach US$1350 billion by 2030, and this upward journey surely begins from 2024. The idea of this icon 👽 might have set the tone of my predictions for this year. Trust me, we are going to witness unimaginable AI implementations in 2024. The breakthroughs of Generative AI in 2023 has setup a dramatic momentum for 2024, and our expectations have risen to a next level. Everyone is waiting for the “Aliens” to appear this year, I mean not literally, but I guess you understand the sentiments.

Before I start, there are pretty obvious things which are going to happen in 2024, like OpenAI‘s GPT-5 will be launched, Generative AI will become a technology risking most jobs by any tech-disruption, and on contrary setting up stage for people with plethora of opportunity in new job-roles — like prompting efficiently. And the start of the year will face a dramatic AI startup-stress because of the business models that are too much affected by OpenAI’s release of add-ons.

“It has become appallingly obvious that our technology has exceeded our humanity” — Albert Einstein

I am very optimistic about this year, but at the same time I am cognizant of the fact that this year is also going to daunt us a lot. And its because we haven’t yet lifted ourselves to the maturity which this technology demands, and I am specifically concerned with the pace of Generative AI’s access to common people in its raw form. So to start with, the table of contents below should clear how I am picturing this for the year 2024.

2023 AI Wrap-up (auto9mous Newsletter)

Its amazing to see how AI adoption happened in the year 2023. With all the Generative AI use cases and new products, last year felt like evolving at an unprecedented speed. Reading through all the stories and posts by AI Influencers, Entrepreneurs, Domain Experts, and literally “Common People“, was so overwhelming. I would call the year 2023, an year of defining the new AI tech-disruption of this decade…

How Responsible a Generative AI Can Be, If We, The Humans, Are Irresponsible?

In the midst of the Fourth Industrial Revolution, as we increasingly integrate Generative AI into our daily lives, we face a critical paradox: Can we expect a machine, inherently void of morals or consciousness, to be responsible if we, the architects and users, sometimes falter in our own responsibilities?

Generative AI systems, like OpenAI’s GPT series or the imaginative Midjourney, have not just demonstrated capabilities to create text or images but have also exemplified the power to inspire, innovate, and occasionally intimidate. Trained on vast troves of data, they’re a mirror, reflecting the collective knowledge, biases, and intentions of humanity.

Before we delve deep, let’s set the context:

Real-world Scenario: In 2020, generative models birthed ‘deepfake‘ technologies, a double-edged sword capable of creating realistic yet entirely synthetic media. While artists found new avenues for creativity, malicious actors found ways to spread misinformation, impacting political landscapes and individual lives.

“A tool is but an extension of one’s hand, an AI is an extension of one’s mind. Both amplify intent; neither possess their own.”

Generative AITypical UsagePotential Misuse
GPT-4Content Creation, Customer SupportSpreading misinformation
MidJourneyImage GenerationCreating misleading imagery

To visualize the evolution and potential implications of Generative AI, consider this simple flowchart:

This blog will uncover the mechanics of Generative AI, examine the landscape of human responsibilities, and ascertain whether there’s a ceiling to how responsible an AI can truly be. But remember, every tool, even AI, requires judicious and mindful human use. The question isn’t just about what AI can do, but more crucially, what we do with AI.

Continue reading How Responsible a Generative AI Can Be, If We, The Humans, Are Irresponsible?

Leveraging NLP and OCR for Business Card Text Extraction

In the age of digital transformation, where every piece of information is becoming rapidly accessible and organized, business cards remain one of the few tangible pieces of professional information exchange. While their physical form offers a personal touch, extracting information from them in a quick and efficient manner poses a unique challenge. To address this I have thought to write my approach for business card text extraction in the best possible manner.

Deep Dive into the Mechanisms, Challenges, and Innovations of Business Card Text Extraction
Deep Dive into the Mechanisms, Challenges, and Innovations of Business Card Text Extraction

In the powerful combination of Natural Language Processing (NLP) and Optical Character Recognition (OCR), NLP enables machines to understand and respond to human language. On the other side, OCR technology converts different types of documents, including scanned paper documents, PDF files, or images taken by a digital camera, into editable and searchable data.

In this blog, we will delve into an innovative method that combines the strengths of both NLP and OCR, specifically the renowned Tesseract-OCR tool, to extract and categorize information from business cards. From identifying specific phone numbers such as office, fax, or mobile numbers to precisely extracting detailed address components like city, state, and country, this technique has shown great potential in revolutionizing the way we process business cards. Join us as we unravel the intricacies of this method and explore its future implications.

Extraction StepDescriptionExample
Optical Character Recognition (OCR)Conversion of images of typed, handwritten, or printed text into machine-encoded text.Image of “John Doe, CEO, XYZ Corp.” ➔ Text: “John Doe, CEO, XYZ Corp.”
Extracting and Classifying Phone NumbersIdentifying various phone number types based on prefixes.Text: “Office: 123-456-7890” ➔ Classified as “Office Number”
Precision Address ExtractionParsing the text for precise extraction of address details.Text: “123 Maple St., Springfield, IL 62704” ➔ Extracted as: Street – “123 Maple St.”, City – “Springfield”, State – “IL”, Zip Code – “62704”
Connected Component AnalysisIdentifying regions of interest for block processing based on pixel connectivity.Image with “John” written closely, and “Doe” spaced apart ➔ Two components: “John” and “Doe”
Future Avenues for Business Card ExtractionPotential advancements in the extraction process.Using Deep Learning for better contextual understanding.
Personal Details ExtractionParsing text for personal names, designations, and organizations.Text: “Dr. Jane Smith, Cardiologist, HealthCorp” ➔ Extracted as: Name – “Dr. Jane Smith”, Designation – “Cardiologist”, Organization – “HealthCorp”
Phone Numbers ExtractionRetrieving various phone numbers.Text: “Fax: 987-654-3210” ➔ Extracted as “Fax Number”
Address ExtractionParsing for the full address.Text: “456 Elm St., Suite 7A” ➔ Extracted as “Address”
Zip Code ExtractionIsolating and extracting postal codes.Text: “… Springfield, IL 62704” ➔ Extracted as “Zip Code – 62704”
City ExtractionIdentifying and extracting city names.Text: “… Springfield, IL …” ➔ Extracted as “City – Springfield”
State ExtractionPinpointing and extracting state names or codes.Text: “… Springfield, IL …” ➔ Extracted as “State – IL”
Country ExtractionRecognizing and extracting country names.Text: “… USA” or “… United States of America” ➔ Extracted as “Country – USA”
Step-by-Step Breakdown of Business Card Information Extraction Processes with Examples
Continue reading Leveraging NLP and OCR for Business Card Text Extraction

Transforming Supply Chain Automation through Cognitive Robotics

Cognitive Robotics, a harmonious blend of AI, Machine Learning, and robotics, signifies the dawn of a new age in numerous industries. By infusing robotic systems with capabilities such as understanding, learning, and autonomous decision-making, cognitive robotics sets the stage for an extraordinary level of supply chain automation.

Existing automation technologies have already improved supply chain efficiency, minimizing labor costs, lead time, and error rates while enhancing productivity. With cognitive robotics, the industry is on the brink of a new era, wherein robots are not merely task performers but cognitive entities capable of decision-making.

Continue reading Transforming Supply Chain Automation through Cognitive Robotics

AIML in Public Safety & Disaster Scenario

The use of artificial intelligence and Machine Learning (AIML) systems can be an effective method for automating disaster response, but they need to be properly trained to interpret disasters for them to be useful.

Guest talk at the Indian Institute of Technology, Guwahati zoom-live session on the launch of the 2022 batch of AIML certification, addressed the questions like;

  1. How AIML is helping this world to be a safer planet for a living?
  2. How big is this disaster problem?
  3. How humans have become intelligent over years by using Artificial Intelligence and Machine Learning to handle Disasters?
  4. How the wildfire in technologically advanced countries is getting handled or maturing to get ready to handle?
  5. How drones are helping to fight disasters?
IIT-Guwahati Talk: Artificial Intelligence & Machine Learning in Public Safety & Disaster Scenario
Continue reading AIML in Public Safety & Disaster Scenario