Blog

Category: HAL149

The real AI is about your time

The real AI thing will be some kind of AI agent that can handle all of your online digital interactions, including but not limited to:

– Managing the agenda,
– Writing and answering emails,
– Web browsing and online transactions,
– Summarising and commenting on news stories
– Taking part in meetings and phone calls, etc.

Imagine having an AI agent that efficiently takes care of all your online interactions, meeting with it for 20-30 minutes a day to give further instructions or get updates, and having the rest of the day to think or do whatever you want.…

People expect ChatGPT to be as accurate as an encyclopaedia and as comprehensive as Google. But that’s a misconception. ChatGPT is trained to create content, not to tell the truth.

GPT models require fine-tuning, embedding, plugins, etc. on top of the base model to provide reliable information.

The real added value of ChatGPT lies in the language skills: for the first time you interact with a machine in natural language.…

The Moving Target of Technology

Everyone is confused and stressed about what to build on AI because everyone is focused on a moving target: technology.

No one is talking about what companies really need. The real challenge is not the AI models, the blockchain, the apps and so on, but how to create value.

Technology is a moving target and will remain so. Its only part of the equation: the fundamental part is people.…

The Layer 2 Business Opportunity

In this insightful talk, Sam Altman emphasises that only a few companies are likely to have the resources to build and maintain Large Language Models (LLMs) such as GPT-3. However, he foresees the emergence of many “layer two” companies with valuations in excess of a billion dollars over the next decade.

“Layer two” is referring to companies built on top of fine-tuned base models that unlock efficiency and progress in domain-specific industries.…

Using AI for Product Ideation

An interesting take on a very interesting topic: can we get creative, valuable ideas from generative AI?.

Some weeks ago I wrote about the core of this problem: finding “what we don´t know that we don´t know“.

But could AI generate the next billion-dollar business idea? Product ideation is the best place to start. If you only have vague initial ideas, generative AI can really help to crystallise them.…

GPT-You: The Last Mile in GPT Models

Achieving real value depends on moving from theoretical models to production-level accuracy. This shift requires investment in data tagging and development.

Specialised, fine-tuned models consistently outperform generic counterparts such as ChatGPT, and outperform alternative approaches such as zero-shot learning and prompt-based methods, as shown in studies.…

Is it worth creating new products on top of ChatGPT?

The fact that OpenAI has just released enough updates to destroy a lot of AI startups and plugins should make everyone think.

Does it still make sense to build products on top of the platform? Why should you invest time and money in creating new products that could soon be cannibalised by these people?

Yet it does make sense why? because most users wont get into paid options, not to mention complex configuration menus.…

Google Brain cofounder says Big Tech companies are lying about the risks of AI wiping out humanity because they want to dominate the market

Its called “regulatory capture”.

Statistics on numerical hallucinations with GPT

If you try in ChatGPT:

“what is the numeric value of 2 * ( 5 * 2 ) replacing ‘*’ by the addition operator?”

You will mostly get wrong answers.

A simulation of 25 questions / answers using davinci 3, gets 50% of the answers right and wrong.

Answer: ’20’ – Count: 13
Answer: ‘2 + 5 + 2 = 9′ – Count: 9
Answer: ’22’ – Count: 2
Answer: ’14’ – Count: 1

Results show how the machine always remembers the expression but in half of the answers it has forgotten the basic premise.…

Business idea generator with 5 parameters

For this business generator, we will use up to five parameters to test the model’s ability to generate creative and innovative business ideas.

Parameters will include:
– Industries: the primary areas of activity, including clean energy, health, and others.
– Models: revenue models, such as subscription, e-commerce, and more.
– Funding: Stage of project funding, including pre-seed, seed, bootstrapped, etc.…

Generador de Ideas de Negocio

Vamos a utilizar un poco de código y la plataforma de ChatGPT (OpenAI) con el fin de crear un generador de ideas de negocio que poder utilizar en HAL.

Para mantener una homogeneidad en los resultados y poder utilizarlos a su vez en un posible fine-tuning vamos a considerar 3 parámetros fundamentales, por simplicidad serán:

– Actividad o industria: salud, talleres mecánicos, inmobiliaria
– Ubicación: Madrid o Valencia
– Financiación: Sin financiación, con 500K de capital semilla.…

Machine Learning and the Data Gold Rush

For the last 200-300 years there’s been something called regression statistics, a regression algorithm that relates known, pre-defined things (‘today is Friday’) to knowledge about other things (‘you use LinkedIn’).

But with machine learning we get into Bayesian algorithms, which means you don’t need a human to pre-define what’s important.…

The most interesting finding of the AI revolution is about learning

Artificial intelligence finds and builds patterns in data. But arranging alphabetical symbols in a way that makes sense to us does not imply intelligence or a sense of meaning.

A stable diffusion model helps you design cars or houses, but it has no idea of the real meaning of those shapes. It goes no further than knowing that they belong to a particular archetypal class or ‘latent space’.…

Chat with PDF Documents using Embeddings

Goal is to convert lengthy documents into numerical representations (vector embeddings) and store them in a vector search engine.

When users engage in conversation with the documents, the system employs Approximate Nearest Neighbor search to find and return relevant text responses. This is achieved using:

– OpenAI’s cost-effective ChatGPT API (gpt-3.5-turbo)
– The vector database, Chroma, is suitable when used alongside LangChain for building applications with Large Language Models (LLMs).…

AI: The real value is in what we don’t know that we don’t know.

Most of the queries that people and businesses make to chatgpt are to retrieve known data that can be found online. And so we expect it to provide real answers to real, concrete questions.

But knowing what the network knows, or even what it ‘knows it doesn’t know’, doesn’t really make a competitive difference: there are lots of people and resources working on it.…

Custom ChatGPT is about prompt engineering

Most people assume that we can train or customise ChatGPT with our own data. But while the end result looks like this, things are fundamentally different. Technically no one can train ChatGPT on your data. OAI doesn’t have an option for it.

At the root of this issue is that each chatgpt thread or API request starts a new conversation, which means the model has no natural memory of conversations.…

The Future is About Transformation, not Disruption

Every technological revolution creates opportunities because it reshuffles the deck. But the rules of the game are the same: it’s always about human needs.

ChatGPT is great compared to what we had before, in the same way that a car is better than a horse: its optimal for certain tasks under certain constraints.

But it is not a one-stop solution for all business problems or some kind of oracle that knows everything.…

And then pessimism became a competitive sport

“AI is likely to eliminate jobs without creating new ones … Goldman Sachs warned in March that AI would cost the world 300 million jobs, a quarter of the global workforce”.
https://lnkd.in/dpFhY_Yx

Biased and anti-human analysis from the big players, as usual.

Here’s another prediction from me:

“Unless power structures, education, culture and society in general change, AI will replace 100% of jobs, including the entire Goldman Sachs organization.”…

If you are a content publisher, get used to the idea that the web and SEO (as we know it) will soon disappear.

What is the point of publishing structured and well-designed information if some kind of intelligent API is going to mix it up and present it at the user’s convenience?

People will simply prefer to interact with a single AI agent and very few will continue to go to old websites to read or make transactions.…

ChatGPT does not lie because it does not know the truth

All the “hallucinating” and “AI lies” chatter is based on one big misunderstanding.

The fundamental goal of GPT is for machines to have language skills so that they can interact with you like a real person.

They have been trained on billions of web pages to create content, not to answer accurately every possible question (and there are technical and philosophical reasons for this).…

Is Google Going to Become the New Yellow Pages?

Google’s current business model is primarily focused on links and clicks, rather than on AI technology. This is based on the assumption that users interact with the WWW, which is a network of websites and links.

However, as AI assistants become more advanced they will start working as augmented search engines, capable of mimicking human interaction, summarizing and presenting information in a fully customized way, with:

– No ads
– No SEO-optimized article crap
– No cookie banner
– No popups, no usability tricks, etc.…

Laplace’s demon incarnated as Stephen Wolfram

This is a sample of how geniuses can also talk nonsense.

Stephen Wolfram completely abduced by Laplace’s demon, determinism, and other 300 years old ideas. This man basically ignores the Uncertainty Principle together with the rest of quantum and chaos theory from the last 100 years.

Its not that molecules trajectories “look” random, its that they are intrinsically random.…

The universe is mostly computationally irreducible, which means it can only be explained by itself. ML models are good at explaining and predicting very limited parts of reality; language has been shown to be one of them, but not human cognition.

The idea of entropy in marchine learning

Machine learning is fundamentally about reducing entropy: finding the basic parameters that define a set of observations so we can either predict or generate new observations.

Reducing entropy can be seen as building a model that explains data. By reducing entropy, we aim to capture the underlying patterns and structure in the data, allowing us to better understand and explain the observed phenomena.…

Embedding “Don Quijote” in GPT 3.5

Here’s some code for chatting and asking questions about the first book of Don Quixote (or any other long pdf, for that matter).

It is about 500 pages in .txt format.

Need to check what´s wrong with langchain CharacterTextSplitter when using texts in SpanishI I created a provisional script to split the text with just length criteria.