cj
a year ago
This is cool, but I wish it were integrated into tools already used for coding and writing rather than having it be a separate app.
This also demonstrates the type of things Google could do with Gemini integrated into Google Docs if they step up their game a bit.
Honestly I’m scratching my head on OpenAI’s desire to double down on building out their consumer B2C use cases rather than truly focussing on being the infrastructure/API provider for other services to plug into. If I had to make a prediction, I think OpenAI will end up being either an infrastructure provider OR a SaaS, but not both, in the long-term (5-10 yrs from now).
jcfrei
a year ago
When they are focusing on just being an API provider then they will be in a market with (long term) razor thin margins and high competition - most likely unable to build a deep moat. But if you can shape customers habits to always input "chatgpt.com" into the browser whenever they want to use AI then that's a very powerful moat. Those customers will also most likely be on a subscription basis, meaning much more flexibility in pricing and more rent for openAI (people using it less then what OpenAI calculates for subscription costs).
james_marks
a year ago
I agree, and it’s why I have come to dislike OpenAI.
We are getting front row seats to an object lesson in “absolute power corrupts absolutely”, and I am relieved they have a host of strong competitors.
adventured
a year ago
The difference between Google had it just tried to be an enterprise search API, versus owning the consumer destination for search input/results.
cynicalpeace
a year ago
Google will be a remembered as a victim of Schumpeter's Creative Destruction
james_marks
a year ago
From Wikipedia, for that don’t know the term: “a concept in economics that describes a process in which new innovations replace and make obsolete older innovations.”
Ironically, I had to google it, and agree with the comment.
LifeIsBio
a year ago
I'm usually pretty verbose with prompts, so I wanted to see what chatgpt would do with just a "Schumpeter's Creative Destruction"
https://chatgpt.com/share/66ff28e2-ea74-800b-a230-86d562f60f...
james_marks
a year ago
Not bad. I do love the ability to share a chatGPT thread. I haven’t found that yet in Claude and would use it.
satvikpendem
a year ago
You should read The Innovator's Dilemma as well, as it goes into detail on this concept, basically explaining why and how technological disruption occurs from the point of view of the disruptor and disruptee.
movedx
a year ago
This sounds like a good way of guaranteeing profits and moving the mindset away from just producing good products.
aixpert
a year ago
Your argument, which could be correct, makes their choice of the name ChatPGT* even more idiotic
* Or which ever variant the average user might try to type in
svat
a year ago
> the type of things Google could do with Gemini integrated into Google Docs
Google already does have this in Google Docs (and all their products)? You can ask it questions about the current doc, select a paragraph and ask click on "rewrite", things like that. Has helped me get over writer's block at least a couple of times. Similarly for making slides etc. (It requires the paid subscription if you want to use it from a personal account.)
https://support.google.com/docs/answer/13951448 shows some of it for Docs, and https://support.google.com/mail/answer/13447104 is the one for various Workspace products.
Zinu
a year ago
Those look more like one-off prompts, and not a proper chat/collab with Gemini.
svat
a year ago
That's there too; see https://support.google.com/docs/answer/14206696 — you can click on the "Ask Gemini ⟡" and carry on a conversation, e.g. "summarize emails about <topic>" and use those to paste into the doc. (I haven't found all that much use for referencing other files though. But the "proper chat" is useful for saying things like "no actually I meant something more like: …" and carrying on.)
al_borland
a year ago
I wouldn't be surprised to see Apple add something like this to Pages and some of their other apps. Their approach to AI, from what we've seen so far, has been about integrating it into existing apps and experiences, rather than making a separate AI app. I have to imagine this is the way forward, and these stand alone apps are basically tech demos for what is possible, rather than end-state for how it should be consumed by the masses.
I agree with you on where OpenAI will/should sit in 5-10 years. However, I don't think them building the occasional tool like this is unwarranted, as it helps them show the direction companies could/should head with integration into other tools. Before Microsoft made hardware full time, they would occasionally produce something (or partner with brands) to show a new feature Windows supports as a way to tell the OEMs out there, "this is what we want you to do and the direction we'd like the PC to head." The UMPC[0] was one attempt at this which didn't take off. Intel also did something like this with the NUC[1]. I view what OpenAI is doing as a similar concept, but applied to software.
acchow
a year ago
Every app with a significant installed user base is adding AI features.
OP is lamenting that Cursor and OpenAI chose to create new apps instead of integrating with (someone else’s) existing apps. But this is a result of a need to be always fully unblocked.
Also, owning the app opens up greater financial potential down the line…
rty32
a year ago
How many people use Pages these days? I don't think Apple even mentions the product in their WWDC these days. My guess is that most people either use Microsoft suite as required by their employer or use cloud based knowledge base/notes tools like Notion/Quip/Obsidian/Confluence etc. I doubt Apple thinks it worthwhile to invest in these products.
al_borland
a year ago
People who need to make the occasional document outside of work, who don’t need to invest in paying for Office, use iWork. I count myself in that list. I use Office at work (99% of that usage is Excel), but at home I use the iWork apps. Mostly Numbers, but Pages as well. I hear many of my friends and family doing the same, because it’s what they have, it’s good enough, and it’s free.
Few people outside of tech circles know what those other apps you mentioned are. I use Confluence at work, because it’s what my company uses. I also tried using it at home, but not for the same stuff I’d use Pages for. I use Obsidian at work to stay organized, but again, it doesn’t replace what I’d use Pages for, it’s more of a Notes competitor in my book. A lot of people don’t want their documents locked away in a Notion DB, and it’s not something I’d think to use if I’m looking to print something.
I went back and looked at the last WWDC video. Apple did mention the apps briefly, to say they have integrated Image Playgrounds, their AI image generation, into Pages, Keynote, and Numbers. With each major upgrade, the iWork apps usually get something. Office productivity isn’t exactly the center of innovation these days. The apps already do the things that 80% of users need.
serjester
a year ago
75% of OpenAI's revenue is coming from their consumer business - the better question is the long term viability of their public API.
But if they believe they're going to reach AGI, it makes no sense to pigeonhole themselves to the interface of ChatGPT. Seems like a pretty sensible decision to maintain both.
narmiouh
a year ago
Apparently it is predicted(1) that their API is a profit making business while chatgpt is a loss leader so far…
(1) https://www.tanayj.com/p/openai-and-anthropic-revenue-breakd...
8338550bff96
a year ago
75%? Thats astonishing to me. Where are you able to see those details?
It wouldn't surprise me if not a lot of enterprises are going through OpenAI's enterprise agreements - most already have a relationship with Microsoft in one capacity or another so going through Azure just seems like the lowest friction way to get access. If how many millions we spend on tokens through Azure to OpenAI is any indication of what other orgs are doing, I would expect consumer's $20/month to be a drop in the bucket.
jdgoesmarching
a year ago
This very good analysis estimates 73%, which includes team and enterprise. Given that enterprise access is limited and expensive, it seems Plus and Teams are mostly carrying this.
The whole financial breakdown is fascinating and I’m surprised to not see it circulating more.
8338550bff96
a year ago
I prefer analysis from industry experts, not PR execs moon-lighting as tech bloggers
jdgoesmarching
a year ago
By all means, go listen to the industry experts trying to sell you on a bubble before it pops
This analysis is just doing basic math based on reporting from the NYT and Post on OpenAI’s financials.
8338550bff96
a year ago
Your source is a blog post by a polemic author whose own source is second-hand by NYT, an organization that is in lawsuit with OpenAI. I would have rather have heard it from the horse's mouth. What financial information about OpenAI does NYT have that I don't? Do they have privileged access to private org financials?
In my estimation, you're not qualified for this conversation.
swarnie
a year ago
It may be pretty minimal but i can personally vouch for 20ish techies in my own social orbit who's businesses wont authorise or wont pay for OpenAI yet and are doing so out of their own pockets; i share an office with four of them.
Maybe the consumer side will slide as businesses pick up the tab?
8338550bff96
a year ago
I don't understand what paying for openai is meant to mean? You mean paying for tokens?
debbiedowner
a year ago
An LLM named Duet has been in Google docs for 17 months now! https://workspace.google.com/blog/product-announcements/duet...
I've been using it for about a year.
franze
a year ago
never figured out on how to activate it in my workspace
karamanolev
a year ago
Same here. I feel like Google's products have become such a labyrinth of features, settings, integrations, separate (but not really) products, that navigating them requires an expert. Sadly, I don't see a way back - each new additional feature or product is just bolted on top and adds more complexity. Given the corporate structure of Google, there's zero chance of an org-wide restructuring of the labyrinth.
herval
a year ago
google's approach to shipping products is puzzling. It's like they don't care if anyone uses them at all
Barrin92
a year ago
Google isn't a startup, they aren't desperate to impress anyone. I don't even think they consider "AI" to be a product, which is probably correct. These AI enabled features are background processes that ideally integrate into products over time in ways that don't require you to explicitly know they're even there.
Given how widely used Google Docs is, for serious work, disrupting people's workflows is not a good thing. Google has no problem being second, they aren't going to die in the next three months just because people on Twitter say so.
herval
a year ago
I think what you mean is "Google is complacent, so they don't think they need to make a lot of effort to stay relevant"
wenc
a year ago
> This also demonstrates the type of things Google could do with Gemini integrated into Google Docs if they step up their game a bit.
This is exactly what Google’s NotebookLM does. It’s (currently) free and it reads your Google Docs and does RAG on them.
HarHarVeryFunny
a year ago
The most amazing thing with notebooklm is that is can turn your docs into a very high quality podcast of two people discussing the content of your docs.
rty32
a year ago
It is a cool concept, but anyone who listens to enough podcasts know that hosts have personalities and interests, and productions usually have their styles, focus and quality. These features make podcast channels unique and make you want to come back. That's why you may want to listen to podcast A instead of B even though they discuss the same topics. I doubt the Google thing will ever give us that -- likely just one hour of generic rambling that gets boring.
supafastcoder
a year ago
It's fun the first time but it quickly gets boring.
8338550bff96
a year ago
Finding signal in noise is not an easy job given clip things are moving along. Whatever content creators need to do to deliver quality distilled content - I'm here for it.
theragra
a year ago
This feature is cool as fuck, but I noticed that podcasts it generates loose quite a lot of details from the original article. Even longreads turn into 13 mins chunks.
ben_w
a year ago
Juggling dog. It's not very good, but it's amazing that it's possible at all.
https://github.com/BenWheatley/Timeline-of-the-near-future
I've only used the "Deep Dive" generator a few times, and I'm already sensing the audio equivalent of "youtube face" in the style — not saying that's inherently bad, but this is definitely early days for this kind of tool, so consider Deep Dive as it is today to be a GPT-2 demo of things to come.
stavros
a year ago
Do you have a reference for the "Juggling dog" thing? I've heard it with "singing dog", but I never managed to find any "official" reference or explanation of the thing.
Jerrrrrrry
a year ago
He meant singing dog, likely conflated due to his linguistic interest.
"Juggling dog" has only been expressed a single time previously in our corpus of humanity:
During the Middle Ages, however, church and state sometimes frowned more sternly on the juggler. "The duties of the king," said the edicts of the Sixth Council of Paris during the Middle Ages, "are to prevent theft, to punish adultery, and to refuse to maintain jongleurs."(4) What did these jugglers do to provoke the ire of churchmen? It is difficult to say with certainty, since the jongleurs were often jacks-of-all-trades. At times they were auxiliary performers who worked with troubadour poets in Europe, especially the south of France and Spain. The troubadours would write poetry, and the jongleurs would perform their verses to music. But troubadours often performed their own poetry, and jongleurs chanted street ballads they had picked up in their wanderings. Consequently, the terms "troubadour" and "jongleur" are often used interchangeably by their contemporaries.
These jongleurs might sing amorous songs or pantomime licentious actions. But they might be also jugglers, bear trainers, acrobats, sleight-of-hand artists or outright mountebanks. Historian Joseph Anglade remarks that in the high Middle Ages:"We see the singer and strolling musician, who comes to the cabaret to perform; the mountebank-juggler, with his tricks of sleight-of-hand, who well represents the class of jongleurs for whom his name had become synonymous; and finally the acrobat, often accompanied by female dancers of easy morals, exhibiting to the gaping public the gaggle of animals he has dressed up — birds, monkeys, bears, savant dogs and counting cats — in a word, all the types found in fairs and circuses who come under the general name of jongleur.”(5) -- http://www.arthurchandler.com/symbolism-of-juggling
ben_w
a year ago
TIL about "jongleurs".
I suspect what I heard was a deliberate modification of this sexist quote from Samuel Johnson, which I only found by this thread piquing my curiosity: "Sir, a woman's preaching is like a dog's walking on his hind legs. It is not done well; but you are surprised to find it done at all." - https://www.goodreads.com/quotes/252983-sir-a-woman-s-preach...
Trying to find where I got my version from, takes me back to my own comments on Hacker News from 8 months ago, and I couldn't remember where I got it from then either:
> "your dog is juggling, filing taxes, and baking a cake, and rather than be impressed it can do any of those things, you're complaining it drops some balls, misses some figures, and the cake recipe leaves a lot to be desired". - https://news.ycombinator.com/item?id=39170057
My comment there predates this Mastodon thread, but the story in Mastodon may predate whoever told me the version I encountered: https://social.coop/@GuerillaOntologist/112598462146879765
SoftTalker
a year ago
It’s a great phrase all that aside. I’m adopting it.
cryptoz
a year ago
The confetti is out of the cannon!
Jerrrrrrry
a year ago
"Dogs were not aware of their shared interest in juggling until the invention of the internet, where like-minded canines would eventually congregate unto enclaves of specialty."
ben_w
a year ago
Unfortunately not.
Trying to find where I got my version from just brought me back to one of my own comments on Hacker News from 8 months ago:
> "your dog is juggling, filing taxes, and baking a cake, and rather than be impressed it can do any of those things, you're complaining it drops some balls, misses some figures, and the cake recipe leaves a lot to be desired". - https://news.ycombinator.com/item?id=39170057
I couldn't remember where I got it from then either.
stavros
a year ago
Oh hm, maybe you came up with it/adapted a previous saying, then. I'm not 100% sure the dog was singing either.
gwern
a year ago
He is adapting one of Samuel Johnson's most famous quotations, about the astonishing sight of seeing a woman preaching - like a dog walking, it may not be done well, but it's astonishing to see it done at all.
ren_engineer
a year ago
ChatGPT itself is them copying their own API users, this is just them building out more features already built by users. My guess is they know they don't have a long term edge in models alone, so they are going to rely on expanding ChatGPT for better margins and to keep getting training data from users. They obviously want to control the platform, not integrate with other platforms
kridsdale3
a year ago
The Amazon model.
Same as it ever was.
carom
a year ago
Their API is unusable due to rate limits. Myself and my wife have both had ideas, started using it, and found other approaches after hitting rate limits. I tried funding more money in the account to increase the rate limits and it did not work. I imagine they see poor growth there because of this.
bearjaws
a year ago
It's pretty trivial to get increased limits, I've used the API for a few consulting projects and got to tier 4 in a month. At that point you can burn near $200 a day and 2 million tokens per minute.
You only need 45 days to get tier 5 and if you have that many customers after 45 days you should just apply to YC lol.
Maybe you checked over a year ago, which was the wild wild West at the time, they didn't even have the tier limits.
epolanski
a year ago
> and if you have that many customers after 45 days you should just apply to YC lol.
What for? If someone has already a business and customers he's already far off the average YC startup.
kridsdale3
a year ago
Who doesn't like free money and marketing?
epolanski
a year ago
It's not free money, you give up on a sizeable amount of equity (7%) you could also sell for more than half a million.
It depends really on the business and what not.
cryptoz
a year ago
150,000,000 tokens per minute and 30,000 requests per minute is unusable?! Maybe that’s just Tier 5 but the API is most definitely not unusable.
I’m firmly in the camp that their rate limits are entirely reasonable.
byearthithatius
a year ago
You need to use it for some time to get into their higher tiers of usage. I used to also have this problem and it annoyed me greatly, but once I got to usage tier 4 it never happened again (except for o1-preview but that just wastes tokens IMO).
w0m
a year ago
If I'm reading this right; it's been in VSCode as Copilot Chat for a fair bit now. I use it often, when they added context (provide extra files to reference or even the entire @workspace if it's small enough), absolute gamechanger.
herval
a year ago
LLM as a service is much easier to replicate than physical data centers and there's a much lower potential user base than consumers, so I'd imagine they're swimming upstream into B2C land in order to justify the valuation
truetraveller
a year ago
You mean downstream, not upstream. Upstream is closer to the raw materials.
herval
a year ago
the bike shed is blue
qingcharles
a year ago
Aren't we talking about, say, GitHub Copilot? That's integrated into Visual Studio/VSCode. I just started using it again as they've done some small upgrades, and the results can often be phenomenal. Like, I will visualize an entire block of code in my mind, and I'll type the first couple of characters and the entire block will just appear. I'm literally that predictable.
Copilot is only using GPT3.5 for most of the results though, seemingly. I'd be more excited if they would update the API they're using.
leetharris
a year ago
> Honestly I’m scratching my head on OpenAI’s desire to double down on building out their consumer B2C use cases rather than truly focussing on being the infrastructure/API provider for other services to plug into
I think it's because LLMs (and to some extent other modalities) tend to be "winner takes all." OpenAI doesn't have a long term moat, their data and architecture is not wildly better than xAI, Google, MS, Meta, etc.
If they don't secure their position as #1 Chatbot I think they will eventually become #2, then #3, etc.
aflukasz
a year ago
> If they don't secure their position as #1 Chatbot I think they will eventually become #2, then #3, etc.
But can they do it at all? It's not like they are like early Google vs other search engines.
ben_w
a year ago
At the moment this feels like a x10 speed run on the browser wars: lots of competitors very quickly churning who is "best" according to some metric, stuff getting baked into operating systems, freely licensed models.
How do you make money off a web browser, to justify the development costs? And what does that look like in an LLM?
visarga
a year ago
LLMs are a more flexible platform than browsers. They can be prompted, finetuned or run locally. Even if a company wants to make their base model spit ads, it won't fly.
ben_w
a year ago
Depends how subtle they are about it, and what the rest of the ecosystem looks like.
Perhaps the ad/ad-blocker analogy would be: You can have the free genuinely open source LLM trained only on Wikipedia and out-of-copyright materials, or you can have one trained on current NYT articles and Elsevier publications that also subtly pushes you towards specific brand names or political parties that paid to sponsor the model.
Also consider SEO: every business wants to do that, nobody wants to use a search engine where the SEO teams won. We're already seeing people try to do SEO-type things to LLMs.
If (when) the advertisers "win" and some model is spitting out "Buy Acme TNT, for all your roadrunner-hunting needs! Special discount for coyotes!" on every other line, then I'd agree with you, it won't fly, people will switch. But it doesn't need to start quite so bold, the first steps on this path are already being attempted by marketers attempting to induce LLMs crawling their content to say more good things about their own stuff. I hope they fail, but I expect them to keep trying until they succeed.
kridsdale3
a year ago
I believe you've nailed it.
Google and Facebook grew organically for a number of years before really opening the tap on ad intrusions in to the UX. Once they did, a tsunami of money crashed over both, quarterly.
The LLM companies will have this moment too.
(But your post makes me want to put a negative-prompt for Elsevier publications in to my Custom Instructions, just in case)
visarga
a year ago
There is huge choice in open models. People won't adopt one with ads baked in, unlike Google and Facebook, because now there are more options. There are 100K LLM finetunes on HuggingFace.
ben_w
a year ago
I've got some of them on my experimentation laptop. They're only good enough to be interesting, not good in comparison to the private models, and the number of fine-tunes doesn't help with that. In particular I've had Microsoft's Phi 3.5 for less than a week and yet I've already had at least 4 cases of it spouting wild nonsense unrelated to the prompt — and I don't even mean that it was simply wrong, I mean the response started off with Chinese and then acted like it was the early GPT-3 "Ada" model doing autocomplete.
One of my machines also has a copy of Firefox on it. Not used that in ages, either. But Firefox is closer in quality to Chrome, than any of the locally-runnable LLMs I've tried are to the private/hosted LLMs like 4o.
bastawhiz
a year ago
I suspect they are building their B2C products because it gives them better data to train on. It's a lot harder to control the quality of data when you have no idea how API inputs were produced, what the UI is like, or who the users are. You don't know the provenance of the data, or the context. Or even if multiple unrelated client products are being commingled through the same key.
If you control the UI, you have none of those problems.
JumpCrisscross
a year ago
> demonstrates the type of things Google could do with Gemini integrated into Google Docs
Or Microsoft!
> think OpenAI will end up being either an infrastructure provider OR a SaaS, but not both
Microsoft cut off OpenAI's ability to execute on the former by making Azure their exclusive cloud partner. Being an infrastructure provider with zero metal is doable, but it leaves obvious room for a competitor to optimise.
munchler
a year ago
Microsoft is integrating Copilot into many of their products, including Visual Studio and Office/365.
joseda-hg
a year ago
VSCode Sure, but my experience with Copilot + regular Visual Studio has been nothing short of abismal
rty32
a year ago
Eh, GitHub CoPilot? Microsoft is literally THE company that understands developers' workflows and creates good IDEs.
rising-sky
a year ago
> but I wish it were integrated into tools already used for coding and writing rather than having it be a separate app
Take a look at cursor.com
fakedang
a year ago
Cursor is a funny company. They were invested into by OpenAI, but almost everyone using Cursor uses it with Claude Sonnet 3.5.
no_wizard
a year ago
To be honest I think they’re having less success than it appears with their B2B offerings. A lot of cloud providers services like AWS have their own things they sell through those channels and I think a lot of businesses are finding those solutions to be cheaper and “good enough”
1659447091
a year ago
> but I wish it were integrated into tools already used for coding
Unless I'm missing something about Canvas, gh CoPilot Chat (which is basically ChatGPT?) integrates inline into IntelliJ. Start a chat from line numbers and it provides a diff before applying or refining.
gnatolf
a year ago
> which is basically ChatGPT?
Yea, I'm wondering the same. Is there any good resource to look up whether copilot follows the ChatGPT updates? I would be renewing my subscription, but it does not feel like it has improved similarly to how the new models have...
1659447091
a year ago
I check the GitHub blog[0] from time to time. They also have a RSS feed if you'd prefer that. The is also a waitlist for o1 access you may sign up for[1]
[0] https://github.blog/changelog/label/copilot/
[1] https://github.blog/changelog/2024-09-19-sign-up-for-openai-...
maestrae
a year ago
According to this (1), they are using the 4o model. And looks like you'll be able to pick your model(2) in the starting with version 1.94 released this September.
1.https://code.visualstudio.com/updates/v1_92#_github-copilot
2.https://code.visualstudio.com/updates/v1_94#_switch-language...
ForHackernews
a year ago
I think this is already built into Microsoft's Office365 "CoPilot" (which I assume is a ChatGPT frontend. You can ask the AI to make changes to your Office documents.
epolanski
a year ago
Being just a service provider makes you easy to replace with other service providers.
Professionals instead don't love to change the tools once they got used to it for small incremental gains.
zmgsabst
a year ago
But my subscription at $20/mo is a fraction of my API usage at $5/day (about $100/mo).
You can sell a lot more GPT services through a higher bandwidth channel — and OpenAI doesn’t give me a way to reach the same bandwidth through their user interface.
isignal
a year ago
Consumer side can allow you to run ads and get Google like revenue in the future.
briandear
a year ago
Not sure how or why you’d want this integrated into Vim for instance.
ygjb
a year ago
idk, I can definitely see value in a lightweight LLM component for VIM to help me look up the correct command sequence to exit :P
kridsdale3
a year ago
HEY SIRI HOW DO I GET THE FUCK OUT OF VI
riffraff
a year ago
google has gemini integrated in Google Colab (jupyter notebooks) and while it doesn't work 100% well, it's a pretty great idea.
mark_l_watson
a year ago
I only use Gemini in Colab perhaps 5% of the times I use Colab, yet it is nice to have.
I use Gemini, OpenAI, Claude, smaller models in Grok, and run small models locally using Ollama. I am getting to the point where I am thinking I would be better off choosing one (or two.)
mmaunder
a year ago
Have you used Canvas?