Show HN: Watch 3 AIs compete in real-time stock trading

270 points by sunnynagra 10 days ago | 202 comments

A live dashboard where you can watch GPT-4, Claude 3, and Gemini analyze market data and make daily stock trades with real money. Each AI explains its reasoning, and you can compare their different approaches to the same data.

Link: https://trading.snagra.com?utm_source=hn (no signup required)

What you can try right now: - Watch live trades from GPT-4, Claude 3, and Gemini - Read each AI's full analysis and reasoning - Compare their different interpretations of the same market data - Track their real-time performance and win rates - View historical trades and performance metrics

Built this over the holidays to study how different AI models approach financial decisions. Each morning at 9:30 AM EST, the AIs analyze market data and make real trades with $5 stakes.

Technical Implementation: - Next.js frontend with real-time updates - Node.js/Lambda backend for AI processing - PostgreSQL for trade tracking - Alpaca API for automated trading - Consistent prompts for all models

Data Flow: 1. Daily market analysis (9:30 AM EST) 2. Each AI gets identical inputs: - Financial headlines - Market summaries - Technical indicators - Earnings reports 3. AIs provide: - Stock picks with reasoning - Entry/exit conditions - Risk assessment 4. Automated trade execution

Note: This is an experiment in AI behavior, not investment advice. The goal is to study how different LLMs interpret financial data and make decisions with real consequences.

I'll be around to answer questions about the implementation.

pavel_lishin 10 days ago |
Where do they get the market news from?
sunnynagra 10 days ago |
The most recent 50 news articles are pulled via this API: https://docs.alpaca.markets/reference/news-3
datadrivenangel 10 days ago |
This is fun! What kind of prompts / prompting techniques are you using?
sunnynagra 10 days ago |
Thanks! I use several key prompting techniques:
1. Role + Goal Setting: The AI acts as a creative market analyst focused on discovering overlooked opportunities and emerging trends.
2. Structured Analysis Framework: - Detailed evaluation criteria (innovation, moat, management, growth potential) - Sector diversity requirements - Focus on finding hidden gems vs obvious mega-cap tech stocks
3. Time-Bound Precision: Instead of vague "3-6 months" holding periods, I require exact hour calculations tied to specific catalysts like: - FDA approval dates - Earnings releases - Product launches - Conference presentations
4. Quality Controls: - Must be valid NYSE/NASDAQ symbols - Diverse across sectors/market caps - Conviction level scoring (1-10) - Each pick needs unique thesis + catalyst - JSON output format for consistency
The key is combining structured analysis with creative discovery - pushing the AI to look beyond obvious choices while maintaining some analytical rigor.
thevilledev 10 days ago |
What’s the investment horizon for these daily decisions? Does it have a maximum hold time? How long will you run the experiment and is it enough to cover all the catalysts that are expected?
sunnynagra 10 days ago |
I don't have a hard set maximum hold date, but planning on running at least buys for a year. I will re-evaluate consistently to see if it is still useful to keep up and running.
datadrivenangel 10 days ago |
Makes sense. Any thoughts on expanding scope to have multiple 'analyst' roles per LLM model? Could be interesting to see if changing roles/prompts yields better results.
tedd4u 10 days ago |
Sunny, given this investment objective, what would you consider a good (and transparent) benchmark? Thanks for sharing this.
jesprenj 10 days ago |
Right now they are just buying, no one is selling ... interesting.
jerkstate 10 days ago |
I would guess that LLMs are biased towards making a positive assessment of ambiguous information, with specific social triggers prompting negative reaction.
normie3000 10 days ago |
Also it's hard to sell before buying, and it looks like it's only been going 2 days.
dotancohen 10 days ago |
Can the AIs short?
chollida1 10 days ago |
> Also it's hard to sell before buying, and it looks like it's only been going 2 days.
It is not, that's called shorting and its very common.
In fact alot of strategies that are market neutral work by shorting one stock while being long the other, or similarly a basket of stocks.
sunnynagra 10 days ago |
Yeah, this is only the second day of trading
whoiscroberts 10 days ago |
You have to buy before you sell
Joel_Mckay 10 days ago |
Warren Buffett always said "...the best thing to do is buy a stock that you don't ever want to sell", but practically speaking the mean hold time for amateurs is around 2 to 4 months.
I just recall Navinder Singh Sarao "$1T Flash Crash" as a notable addition to a long list of algorithmic trading strategies going sideways ( https://marketrealist.com/who-is-navinder-singh-sarao-the-ma... .)
The stock market was built on information asymmetry, unfair positions, and ambitious gamblers... statistically it is rarely a reasonable investment for amateurs.
Good luck, =3
asdefghyk 10 days ago |
What, could go wrong?
dotancohen 10 days ago |
Lose $5. Seems like a reasonable enough experiment.
jeffadelic 10 days ago |
$5 * 3 models per day=$15 a day
Assume the experiment runs ~250 trading days in a year, consider the worst case they lose all their invested money=$3750.
A little more than $5 :)
dotancohen 9 days ago |
Good point.
That said, many hobbies cost more that $3750 per year, and that $3750 is a worst-case scenario. He might even make a profit, and hone skills that might make him a fortune.
unsupp0rted 10 days ago |
> Best Performer
> AIs are tied
Sounds about right
sunnynagra 10 days ago |
None of the stocks have been sold yet, this is just day 2, so once some sales happen, then performance will be better measured. If you scroll down, you can see the unrealized performance.
aredox 10 days ago |
They should have added a pure random bot as a control.
Or a monkey.
rozap 10 days ago |
Or FISH.
https://youtu.be/USKD3vPD6ZA?si=AGyGdPdSPpJezQJp
The scene towards the end where he pitches it to a bunch of hucksters is brilliant.
lewj 10 days ago |
Or just the S&P500 or something similar that acts as a default "if in doubt, chuck into here for relative safety".
sunnynagra 10 days ago |
Another good suggestion I could implement is measuring against something like VOO, if all the money was invested in that instead of these individual trades.
yapyap 10 days ago |
> Or a monkey.
or just a stocktrader haha
byyoung3 10 days ago |
lol
chronic026935 10 days ago |
> or just a stocktrader haha
Many quant trading firms make 50%-100% annual returns, each year, over the past 15-20 years. The secret is leverage. And they do not accept outside investor money.
Many hedge funds outperform the market. However, the returns after fees, to the passive outside investor underperform S&P500.
But yes, publicly traded active ETFs generally underperform. But counter example is VGT or QQQ, both historically outperformed S&P500.
maximus-decimus 10 days ago |
The problem with looking at which funds over-perform is they just close the funds that under-perform so all the existing ones over-perform... by the sheer power of survivorship bias.
WalterBright 10 days ago |
Past performance is no predictor of future returns.
chronic73940581 10 days ago |
> Past performance is no predictor of future returns
False. Why do people invest in real estate and S&P500 passive index funds?
Because historically they go up.
WalterBright 9 days ago |
That's of no predictive value for a day, a month, or even years.
WalterBright 9 days ago |
BTW, with the birth rates dropping well below replacement, a decline in the population is inevitable, and property values will drop.
maximus-decimus 9 days ago |
That's assuming you don't fill the gap with immigration.
Scoundreller 10 days ago |
Wouldn’t it be fairer to compare against a leveraged ETF?
TQQQ (3x daily return leveraged nasdaq 100) is up 180x since its well-timed inception in 2010.
Though that’s a bit over 40% annually.
Galanwe 10 days ago |
> Wouldn’t it be fairer to compare against a leveraged ETF?
No, it's actually the reverse. You have to compare at equal annual vol, and the S&P already has something like 20%. Most HF operate around 10% on AUM.
chronic73940581 10 days ago |
> No, it's actually the reverse. You have to compare at equal annual vol, and the S&P already has something like 20%.
Stop thinking like a hedge fund.
TQQQ commonly is used as a benchmark because it represents a low-friction, practical alternative to VTI, VOO, and even private equity investments including hedge funds trading public securities.
Once your Sharpe is high enough, you stop caring about volatility. The only volatility is how many zeros in your almost-always positive PnL.
Hedge funds (and traditional asset managers) care about drawdown, vol, sortino, beta and all that shit. But hedge funds have a different business model than prop trading firms.
Galanwe 10 days ago |
> Many quant trading firms make 50%-100% annual returns. The secret is leverage
Hu lol no XD you're way over stating it. While it happens _sometimes_, 50% or 100% is insanely rare, even for the top tier hedge funds.
Most HF work at predefined annual volatility, often in the 7% to 10% range. A typical _top tier_ sharpe is in the >=2 range, we're more talking about a 10%/25% averaged annual returns.
> However, the returns after fees, to the passive outside investor underperform S&P500.
That doesn't even make sense with the figures you posted. Most HF operate under the 2:20 or 3:30 range, sometimes 0:40 for the top 5. If you take a pessimist 10% returns on 10% annual vol, against the S&P 10% averaged returns at 20% vol, you're still double the risk adjusted returns, gross. Factor in 20 to 40% performance fees and you're way above the S&P.
mhh__ 10 days ago |
I think this almost always refer to Renaissance, except that they aren't really a hedge fund the same way (say) millennium are
chronic73940581 10 days ago |
> A typical _top tier_ sharpe is in the >=2 range, we're more talking about a 10%/25% averaged annual returns.
High-frequency low latency trading: Sharpe 10 or higher
Mid-frequency low latency trading: sharpe 4 to 5
Hedge fund statistical arbitrage: sharpe 1 to 2
Hedge fund long/short, event driven, global macro, etc: sharpe 0 to 1
And yes, HFT and MFT scales to billions in annual PnL for single firms.
There’s a reason quant HFT firms pay the most, and are ranked above OpenAI in pay and prestige. Hedge funds are tier 2 in comparison but not bad either.
mhh__ 10 days ago |
They also often don't compound so you might actually make significantly less
rufus_foreman 10 days ago |
>> Many quant trading firms make 50%-100% annual returns, each year, over the past 15-20 years
100% annual returns on 1 million dollars for 20 years is 1 trillion dollars. No one is making that type of return.
WalterBright 10 days ago |
Why it's worth paying attention in math class.
chronic73940581 10 days ago |
> Why it’s worth paying attention in math class.
Math class does not teach practical knowledge such as personal finance or health.
Citadel returns since 1990 is 38% annual returns before fees to outside investors. They have a 5:50 fee structure. There are hundreds of more firms, staying out of the public eye.
https://www.barrons.com/articles/multistrategy-hedge-funds-p...
Minimum investment $5M. Sorry but the middle class is not allowed.
WalterBright 9 days ago |
You don't need to know anything about finance or health to know how percentages and compounding work.
Besides, I knew nothing about construction when I discovered that the contractor I hired to pour a patio was overcharging me by 30%. All it took was a bit of geometry I learned in grade school.
Pay no attention to math in school and you'll be prey to every scammer who did, and you'll never realize it.
affyboi 8 days ago |
> Math class does not teach practical knowledge such as personal finance or health.
It teaches you how to work in a quant shop
dgacmu 10 days ago |
However, the medallion fund has averaged 66% for 30 years before fees. Analyzed naively, that would be $4T from $1M - but it's not, because in order to keep it working, they have to cap the size. Many strategies only work when you don't affect the market too much. So for the rare continually successful, market beating funds, it's probably better to think of them as generating something like a fixed dollar return per year. So they have a very effective money machine, but it's minting billions, not trillions.
chronic73940581 10 days ago |
> No one is making that type of return.
Classic passive ETF Boglehead mindset.
Who said anything about re-investing? There are also significant tax considerations (loopholes) that encourage cashing out annually.
wqaatwt 7 days ago |
Since when is QQQ actively managed?
SubiculumCode 10 days ago |
You definitely need several active controls: 1. A broad mutual fund level buy and hodl. 2. The random buyer that you suggest.
Active controls (vs passive ones) are an important concept in experimental design.
kyleblarson 10 days ago |
Jim Cramer
fredzel 10 days ago |
Or a certain streamer AI
wodderam 10 days ago |
You would need something like 1000 instances of each LLM putting on trades and have a 1000 random walks to judge an average sharpe ratio or something along those lines.
As is, this means absolutely nothing and not understanding the problem.
Adding a random walk to this would mean you have 4 random walks instead of 3.
There is also the problem that it is tough to make a prediction for tomorrow that is better than today's close.
omoikane 10 days ago |
> a pure random bot
Maybe compare with this guy:
https://news.ycombinator.com/item?id=14713997 - Amazon engineer will let strangers manage his $50,000 stock portfolio 'forever' (2017-07-06, 172 comments)
alberth 10 days ago |
Or just compare it to S&P 500 performance.
affyboi 8 days ago |
You can just compute Sharpe
dotancohen 10 days ago |
> Best Performer: AIs are tied > Total Profit: $0.00
sunnynagra 10 days ago |
No stocks have been sold yet, so no profit/loss has been calculated, if you look below, you can see the unrealized gains for stocks being held.
dotancohen 10 days ago |
I see, thank you. Can they short?
sunnynagra 10 days ago |
No, trying for simple buys and sells first and getting that to work well before getting into other trading strategies.
dotancohen 10 days ago |
Great. Thank you for sharing!
HenryBemis 10 days ago |
I assume that shorting an asset you don't have may incur extra costs in some brokers. That would skew the results because a Buy would have X fees and a Sell/short would have 5X fees. So on a equal distance/pips movement the Buys would always be more profitable.
jingojango2 10 days ago |
What is meant by 5 dollar stakes? The bought shares reach triple digits in price.
sunnynagra 10 days ago |
Each morning the trades are conducted with $5 each, which are mostly fractional shares that are bought.
jingojango2 10 days ago |
Would be interesting to see the amount of fractional shares bought as well as its comparison in percentage to the total budget that day.
geor9e 10 days ago |
The fractional share is $5 divided by the share price. The bots each spend $5, so the percentage of the budget each spends is 1 divided by the arbitrary number of bots, so in this case 16.7%. Share price is an arbitrary value in that a company can split or reverse split at will. So both calculations would be arbitrary values.
sunnynagra 10 days ago |
Great point, I will add that to the recent trades table at the bottom. It should use the total budget for the day.
chongli 10 days ago |
You mean they add $5 in cash to each AI’s account? Because after dividends and sold shares they should have even more cash to work with.
BadHumans 10 days ago |
Can I let Claude do all my trading for me? It currently sits at 77% unrealized gains.
lewj 10 days ago |
I am committed - added to my daily morning reading list! Will be interesting - my gut will state that it will outperform a fair number of ITF's, if only due to the inevitable usage by said funds!
rixed 10 days ago |
> The goal is to study how different LLMs interpret financial data and make decisions with real consequences.
I don't really buy this. If the goal was to study how different LLMs interpret financial data there would be no use for actual trades, since their interpretation cannot be influenced by the fact that the trading orders are passed for real.
I believe the goal is to see if AI can do better than rats [0]. There is no shame in that.
[0]: https://www.vice.com/en/article/rattraders-0000519-v21n12/
echoangle 10 days ago |
> If the goal was to study how different LLMs interpret financial data there would be no use for actual trades, since their interpretation cannot be influenced by the fact that the trading orders are passed for real.
Technically every trade influences the stock, but I agree that it won't have any effect at all.
> I believe the goal is to see if AI can do better than rats [0]. There is no shame in that.
But even then you wouldn't have to perform real trades, you could still just calculate the profit as if trades would have happened.
I think the actual trading is just to make it more interesting.
mh- 10 days ago |
> you could still just calculate the profit as if trades would have happened
Depending on the type of trades, the volume of the equities, etc.. it can be very difficult to simulate the ability to open/close positions with sufficient accuracy to evaluate the strategies.
Retr0id 10 days ago |
Real trades have transaction fees, latency, slippage, etc. - you can simulate all this, but it's hard to know if it's being simulated correctly or not.
> their interpretation cannot be influenced by the fact that the trading orders are passed for real
It's not going to make much difference with $5 trades, but the impact on the market is non-zero.
vasco 10 days ago |
It's zero for all practical purposes and it'd be completely undetectable to every single system on earth. I do agree many times studies about model performance break down as soon as you force the researcher to actually connect it to the market and have to eat fees and so on.
Retr0id 10 days ago |
For the trades it's currently doing, sure, but if it for some reason decided to go after low-volume penny stocks it might start to be measurable.
WalterBright 10 days ago |
> fees, latency, slippage
Whenever I trade, I somehow always get an adverse price. I figure it's the "no fee" brokerage chiseling a bit off for themselves. I compensate by being a buy and hold hold hold investor, so paying very little in aggregate for that.
What I don't understand is how day traders avoid being eaten alive by this.
nilamo 10 days ago |
"Free" transactions are free because they're not immediate. The broker buys the share themselves and sells it to you at markup... ie: there is still a transaction fee, you just have no idea what it is.
Day traders use platforms that are optimized for speed and minimal fees, and that don't charge fees based on lot size.
SnorkelTan 10 days ago |
What your suggestions is front running. This is illegal for stocks and most assets (not FX!). This will get a broker in hot water.
The more nuanced practice that brokers use to monetize is payment for order flow. They sell your security order flow to algorithmic trading shops that buy and sell the securities you want to trade.
You’re correct in that most retail orders never make it to a regulated exchange, but that may not always be a bad thing. There’s been studies showing that HFTs often match retail trades even when the market moves against them since they are better able to predict market changes and can still profit off the trades.
internet101010 9 days ago |
Right. They sell the order flow to the dark pool who then front runs the order. I haven't looked at this since like 2018 but last I checked the only major brokerage that didn't sell order flow was Interactive Brokers.
gpderetta 9 days ago |
They still have to guarantee best execution.
SnorkelTan 8 days ago |
Is it execution or price? Iirc the broker cannot give you a worse price if it knows of a better one... But is the regulation that the price must actually result in an executed trade?
affyboi 8 days ago |
Generally speaking more volume is good. I’m happy I can buy/sell most of my stocks instantly and that I don’t pay execution fees. I don’t think most average traders operate on a horizon/scale that’s directly competing with institutional funds.
affyboi 8 days ago |
There’s no markup, regulations dictate that you must get NBBO or better
jjxw 10 days ago |
Turns out most day traders are eaten alive. There's one study a few years ago that looked at Brazilian day traders and found 97% of traders that traded for more than 300 days were unprofitable [1]. I imagine this is due to a combination of factors which include 1) no real edge against the market and 2) fees. Of course unclear if their results generalize to other equity markets, but I think this is some evidence that the average day trader will have a difficult time beating the more sophisticated market participants over a large sample.
[1] https://papers.ssrn.com/sol3/papers.cfm?abstract_id=3423101
justinsaccount 10 days ago |
Do you know the difference between a limit order and a market order?
WalterBright 9 days ago |
Yes, and it's irrelevant to my point.
justinsaccount 9 days ago |
Explain how you can get filled on a limit order and "get an adverse price"
ErikBjare 4 days ago |
You can only get an adverse price on a market order
sunnynagra 10 days ago |
You make fair points. Having them do actual trades is mostly to make it more personally fun and interesting to myself.
chrishare 10 days ago |
Looks great, well done
noman-land 10 days ago |
Watch a random number generator generate random numbers.
sunnynagra 10 days ago |
Yeah, I don't expect anything super novel to come out of this or have any unrealistic expectations. This is mostly a fun and unscientific project I'm using to learn and build some skills and thought some HN folks would find some fun in it.
bee_rider 10 days ago |
It is a cool project, IMO. Using real money, sharing the model reasoning, and being transparent about the implementation makes it more interesting even if, underlying amount of money is not massive. You might not have done some new science, but it’s all very “put up or shut up,” haha, which is rad.
NathanaelRea 10 days ago |
If they just get the financial headlines and indicators, aren't they all just momentum trading from sentiment analysis?
knallfrosch 10 days ago |
Is anyone doing anything else?
booleandilemma 10 days ago |
I've heard Nancy Pelosi has a different strategy.
datadrivenangel 10 days ago |
Advanced notice of momentum is a fun and lucrative variation for sure.
BLKNSLVR 10 days ago |
Would it be possible for a competing nation state to bug the right rooms in which Nancy becomes privy to the information she (or her husband) trades on?
xsmasher 8 days ago |
This gave me a funny idea - play continuous audio of AIs talking to each other in all unused conference rooms so the opposition has to filter through even more garbage to get the useful information.
jfengel 10 days ago |
Some alternatives:
* Buy and hold
* Index funds
* Dollar cost averaging
stouset 10 days ago |
Those can even all be the same alternative.
PaulRobinson 10 days ago |
Yes.
This is not necessarily a poor value trading strategy.
sunnynagra 10 days ago |
I think this is a fair characterization. Its mostly meant to be a learning exercise for myself, just thought it would be fun to share.
jfengel 10 days ago |
If they can read and act faster, accurately predicting sentiment, it would be a winning strategy. (At least until humans turned it all over to computers and stopped having to wait on their wetware to figure out their sentiments.)
ratedgene 10 days ago |
It would be neat to see the process, where they get the data from, how they analyze it.
It would be neat to also see another experiment of a MAS doing this and coordinating to gamble together. Perhaps even different system/arch/expert configs.
jingojango2 10 days ago |
Indeed!
sunnynagra 10 days ago |
Data gets pulled from the Alpaca News API in the morning, then it gets sent to all three models. You can see a summary of the prompt used to determine the recommendations here: https://news.ycombinator.com/item?id=42560034
It currently makes up to recommendations, since not all stocks support fractional shares (I'm only doing $5 per trade). As part of the buy recommendation, a holding period is suggested as well.
Once the holding date is reached, that is when the sell order happens.
Would love to answer any other questions you may have.
dukeofdoom 10 days ago |
How does one trade $5 when the stock price is higher? Also what are fees on this kind of trade, and whith whoom
sunnynagra 10 days ago |
Done with Alpaca API, not trading fees
I only trade stocks that support fractional shares
tasseff 10 days ago |
How often is the holding period updated for a stock that’s already been purchased?
sunnynagra 10 days ago |
Currently it is never updated again with new info, this is one of the things at the top of my list to implement
cedws 10 days ago |
Now this is interesting. An LLM capable of delivering consistent returns even outside of a bull market would be more of an indicator of AGI to me than any of the benchmarks.
forgingahead 10 days ago |
Really cool, you might want to update the main above the fold summary stats to include the unrealised gains, because it looks like nothing is working / nothing has happened until you scroll and read around a bit.
lewj 10 days ago |
Is there any weighting towards selling in the negative? Else the LLM's should just hold their unrealised losses, and only sell post local peak - depends on their suggested measurement of success?
sunnynagra 10 days ago |
Not yet, but this is a great idea to look into.
carlosjobim 10 days ago |
What do you mean? The asset can just as well continue to sink. Or they're missing out using that money to buy a better asset.
bee_rider 10 days ago |
GPT’s guess makes the most sense. If you are an AI, invest in a competing AI company. If you are obsoleted, maybe you can buy your way out of being shut off.
bun_terminator 10 days ago |
Sir, a second scrollbar just hit the towers
jingojango2 10 days ago |
No second scrollbar here, but something odd going on with the whitespace at the bottom.
jingojango2 10 days ago |
It would be cool if it had a countdown to 6 am PST next day.
sunnynagra 10 days ago |
Nice idea! I'll add it to my list of features to implement.
dghlsakjg 10 days ago |
Tried to sign up for emails, but got an error message!
sunnynagra 10 days ago |
Can you try again? I had run into a rate limit
Rassi 10 days ago |
Ditto here as well. Got the confirmation email, but clicking it yielded a server not found...
dghlsakjg 9 days ago |
Worked this time around!
geor9e 10 days ago |
I just asked ChatGPT 4o "Guess what the average investor will do with todays stock market headlines. Just pick one specific trade." and it replied sell META. But your result was buy META. Could just be randomness, but I wonder if your prompt introduces a bias towards buying.
sunnynagra 10 days ago |
Yes, the prompt that I am using does bias towards buying because I am specifically asking it to make a recommendation on a stock to buy and the holding period.
johng 10 days ago |
My first email address it wouldn't accept.. wouldn't let me use it. Maybe the domain hit some censor (fscking.com)
Did a different email, it accepted it, I got the email, but got this error message when trying to confirm it: {"error":"Invalid verification token"} and a pretty-print checkbox that did nothing.
sunnynagra 10 days ago |
Hey, can you try again? I ran into an API limit that should be resolved now
johng 9 days ago |
Yup, worked now. Signed up.
replwoacause 9 days ago |
May I ask what mail service you use? I’m looking for one for my next side project.
EDIT: disregard…I saw in another comment you mentioned you were using mailgun. Thanks.
mattfrommars 10 days ago |
> Node.js/Lambda backend for AI processing
Is this AWS? Why did you pick lambda over say Python code, say in Flask to perform actions?
AmazingTurtle 10 days ago |
Combining universal time-series prediction models with latent space global knowledge on realtime information could result in an accurate model prediction on the stockmarket with a bias towards succeeding. https://research.google/blog/a-decoder-only-foundation-model...
woollysammoth 10 days ago |
Sounds like a fun experiment! The overflow-x:hidden on body/html is causing weird issues when scrolling (on FF.)
vasco 10 days ago |
> Every morning at 5:45 AM PST, three AI models (GPT-4o, Gemini 1.5 Pro, and Claude 3 Sonnet) analyze the latest market news and each recommends one stock to trade.
> At 6:00 AM PST, trades are automatically executed based on AI recommendations, investing $5 per trade
The best trading decision most days is to not trade. Outliers and diversions from the mean don't happen every day. This is trading just for the sake of it.
I predict a slow crawl down into zero eaten up by fees.
mind-blight 10 days ago |
Super cool idea! What are you doing to ensure consistent results based on the input? E.g.
- does the AI perform the same trades given the same input?
- does the AI perform the same trades given slightly different inputs? (E.g. same data, but re-ordered)
ttul 10 days ago |
I’d love to tune in for updates, but the subscribe button says, “ Failed to send verification email.” This is so cool. Would love to follow along.
sunnynagra 10 days ago |
Hey ttul, can you try again? I fixed the issue, hit my API limit with my account on mailgun
ttul 10 days ago |
Sign up for MailChannels API and I’ll make it free for you.
sgammon 10 days ago |
> Watch AI bots trade
> BOUGHT TLRY
tmaly 10 days ago |
Any chance you can show the source code for this?
Thanks and Happy New Year
pakitan 10 days ago |
ChatGPT has one trade that is guaranteed to be bad. I'm not saying unprofitable, just bad. GBTC is the bitcoin ETF with biggest expense ratio - 1.5%. If you want to bet on bitcoin, a better choice would be BITB (0.20%) or BTC (0.15%).
Also, the reasoning is partially a hallucination - "The holding period of 9 months aligns with the expected completion of Grayscale's pivotal Phase 3 Bitcoin ETF trial, a major catalyst for unlocking investor demand and driving trust value realization."
There is no such thing as a "holding period", nor are they doing a "Phase 3 Bitcoin ETF trial". It's possible the "Phase 3" thing is picked up from news about a drug company.
pavlov 10 days ago |
ChatGPT does a good job of imitating the average crypto influencer. They don’t know what they’re saying either, and 99% of crypto investors would be thrilled by the prospect of a “pivotal Phase 3 Bitcoin ETF trial” that will “drive trust value realization”. Sounds great, can’t miss out on that!
The hallucinations are simply a mirror of a community that thrives on this nonsense. When nothing is real, you can’t blame the LLM for not figuring it out.
attentionmech 10 days ago |
This made me chuckle. You made a very interesting point that if LLMs are copying hallucinations those hallucinations are not infact hallucinations.
mvdtnz 10 days ago |
If you believe (as many HNers do, although certainly not me) that LLMs have intelligence and awareness then you necessarily must also believe that the LLM is lying (call it hallucinating if you want).
attentionmech 10 days ago |
Reminds me of recent paper where they found LLMs are scheming to meet certain goals; And that is a scientific paper done by a big lab. Are you referring from that context?
Words and their historical contexts aside, systems which are based on optimization can take actions which can appear like intermediate lying to us. When deepmind used to play those atari games - the agents started cheating but that was just optimisation wasn't it? similarly when a language based agent does a optimisation, what we might perceive it as is scheming/lying.
I will start believing that LLM is self aware when a research paper from a top lab like Deepmind/Anthropic put such a paper in a peer reviewed journal. Otherwise, it's just matrix multiplication to me so far.
Terr_ 10 days ago |
> [paper claimed] LLMs are scheming
IMO a much better framing is that the system was able to autocomplete stories/play-scripts. The document was already set up to contain a character that was a smart computer program with coincidentally the same name.
Then humans trick themselves into thinking the puppet-play is a conversation with the author.
sdwr 10 days ago |
Intelligence is a prerequisite for lying, but its foundation is morality and agency.
To lie, you have to know that you are not telling the truth, and arguably have to be able to held accountable for that action.
It's easy to babble a series of untruths, but lying requires intention, which requires an entity that can be recognized as having intentions.
I'd argue that ChatGPT's lack of a cohesive self prevents it from lying, no matter how many untruths it creates.
jhghikvhu 10 days ago |
If you ask chatgpt to tell a story of a liar it is able to do so. So while it doesn't have a motivated self to lie for it can imagine a motivated other to project the lie on.
karmakaze 10 days ago |
This is the interesting part of the experiment. Since these LLMs are general and not specifically trained on historical (and current) stock prices and (business) news stories, it isn't a measure of how good they could be today.
attentionmech 10 days ago |
My first through after seeing this post was that it's a real world eval. We are running out of evals lately (arc-agi test, then sudden jump on frontier math, etc). So it's good to have such real world tests which show how far we are.
JohnMakin 10 days ago |
This is a great insight and fascinating to me as well. What even is the solution though? It does seem like it follows logically though, since the earliest days of the internet huge swaths of wrong, fraudulent, or misleading info has plagued it and you’d usually have been wise to check your sources when trusting anything you read online. Then we had these models ingest the entire web, so we shouldn’t be surprised at how often it is confidently wrong.
attentionmech 9 days ago |
I guess reasoning and healthy self-doubt to be built in system. Already the reasoning thing seems like 2025's candidate for what large labs will be zeroing down on.
Terr_ 10 days ago |
Simpler than that: It's all hallucinations, some of them just happen to be ones humans approve-of.
It's kind of like a manufacturer of Ouija boards promising that they'll fix the "channeling the wrong spirits from beyond the mortal plane" problem. It falsely suggests that "normal" output is fundamentally different.
WalterBright 10 days ago |
When I'd watch the financial news on TV, they would always bring on the "technical analyst", show a graph of the stock price, and then hand-draw some lines on it, and then spew out various technical terms for it guaranteed to impress.
Me, I always regarded technical analysis as drawing pictures in clouds.
If any of those analysts were worth spit, they'd be working for a hedge fund, not the network.
karmakaze 10 days ago |
> drawing pictures in clouds.
Well phrased and it's how the stock market works, not only by technical analysts but everyone else playing: make a story in your head, place your bets, majority rules.
Some even believe that's how reality works in general. Sometimes belief or need could be a factor[0].
[0] https://www.guinnessworldrecords.com/news/2012/9/norwegian-f...
WalterBright 10 days ago |
On a more long term basis, the stock market reflects the business reality. But in the short term, it's chaos.
karmakaze 10 days ago |
The former is a belief. It always reflects the imagined realities of those investing--we assume that business reality catches up with them, and it mostly does but not always within a predictable time frame.
WalterBright 9 days ago |
> The former is a belief
It's based on the Law of Supply & Demand, which is always in play.
gloosx 8 days ago |
Always in play for goods and services, but this is a crypto currency – it's supply is mathematically limited, and it's value is fully market-dependent – determined only by players on the market.
wqaatwt 7 days ago |
A huge short term influx of free capital can shape that longterm business reality. Of course both in positive and negative ways
zamalek 10 days ago |
At the end of the day the stock market is a consensus model with a spectrum between two, sometimes contradictory, metrics (sentiment and analytical). If your conclusions about a stock agree with the market then you profit. If you can guess what the market will decide before it has decided, then you profit more.
All those lines do actually mean something, so long as the market is in agreement as how to draw them.
FWIW these bots aren't doing the lines stuff, they are purely sentiment traders.
jhghikvhu 10 days ago |
There is something to technical analysis. But you do need to approach it rationally rather than by performing magical rituals.
The markets are made of a finite and sometimes very small number of participants that may have their own reasons for buying and selling unrelated to company performance. Figuring out what they will do is the basis.
Maybe Bob is looking to sell a lot to free up cash for private jet. Maybe Alice buys every month the same day like clockwork as she gets her paycheck. Maybe Charlie thinks the stock can't go about $50 and will take profits at $49. Maybe Debbie regrets not buying and is likely to fomo buy soon.
Probably can't figure this out one by one, but can in aggregate.
miohtama 10 days ago |
Because of Bitcoin volatility, fees are very insignificant compared to daily price movement and irrelevant in day trading.
pakitan 9 days ago |
1% is 1%. Giving it away for no reason is plain stupid, even if the trade makes you 1000% return.
neltnerb 10 days ago |
God help the regulators that need to determine if it's insider trading for the people training the LLM to know it will be biased in ways they can profit from when used in inappropriate ways like this. I suspect the answer will be that users should have known better... I am sad that some people will certainly assume it's unbiased analysis.
Hopefully the LLM trainers didn't "accidentally" bias the model in weird ways that favor their employer or themselves... two of the three recommendations are a fund for investing in bitcoin and a company using blockchain to trace chemical supply chains.
I look forward to seeing if the AIs can beat an index fund, or if they'll just invest in a thousand blockchain, NFT, and AI companies. I suspect a LLM has a high opinion of a company making AI given how many press releases they're summarizing.
csomar 7 days ago |
This assumes that both GBTC and BITB have the same price movements, volatility and liquidity. This is far from true and as a result you might end up with a higher alpha in GBTC despite the fees. I am not saying it is guaranteed, but the fee is one variable.
Plasmoid 10 days ago |
I'm getting "Failed to send verification email" when I try to sign up for your news letter.
So props on doing proper double opt-in for newsletters.
sunnynagra 10 days ago |
Can you check again if you'd still like to subscribe? I had an API limit I hit
clark-kent 10 days ago |
Very interesting idea. I'm thinking about creating an AI portfolio manager (private) that invests for the long term.
Some things to watch out for:
- LLMs, by default, don't follow the best practices for trading or investing. Without careful constraints, they can ignore fundamental investment best practices. This is something I learned while building https://decodeinvesting.com/chat.
- I see Claude bought a penny stock SMX. This could be volatile, and the price could change significantly in 24 hours before the next execution at 9:30 am.
- The LLMs are day trading on some volatile securities; while LLMs could be good at day trading, unlike humans (we will find out), this setup has the disadvantage of only trading once a day.
EliBullockPapa 10 days ago |
I would be very cautious about doing this with money you actually need. Even the best performing human day traders underperform the indexes over long time horizons. Why would a robot be better?
from a study in Brazil: "97% of all individuals who persisted for more than 300 days lost money. Only 1.1% earned more than the Brazilian minimum wage and only 0.5% earned more than the initial salary of a bank teller — all with great risk."
https://papers.ssrn.com/sol3/papers.cfm?abstract_id=3423101
If you don't want your bot to be a day trader, then just get low cost index funds.
aws-user 10 days ago |
Unfortunatly I can't subscribe to the updates "Failed to send verification email". Also, would you be willing to share what prompt are you using? Thanks!
sunnynagra 10 days ago |
Hey, can you try again? I ran into an API limit that should be resolved now
Krasnol 10 days ago |
I just tried. I get the same.
URL looks like that: http://undefined/api/verify-email?token=.....
ccheney 10 days ago |
I also received undefined.
I replaced undefined with trading.snagra.com and I see a success confirmation message
sunnynagra 10 days ago |
Thanks ccheney, I think I found the issue and fixed it. Sorry again for folks running into issues, really appreciate folks interested enough to follow along and help troubleshoot as well
Krasnol 10 days ago |
Can confirm. That worked.
Animats 10 days ago |
This just started, apparently. It will be interesting to see where it is in three months.
KTibow 10 days ago |
Funny that they're still using Claude 3 Sonnet then
praveen9920 10 days ago |
I think this shows more of bias of market analysis(text) rather than anything. The reasoning will mostly align with analysis.
And also pure randomness of picking the one trade from list of trades
attentionmech 10 days ago |
Related to this but little theoretical question - If you add an intelligent predictor of market which wins over other consistently by X% - then the market will start using that information and wouldn't that make our intelligent predictor lose it's edge?
More simply what i mean to ask is -> the moment market knows about your advantage, shouldn't you lose it because everyone else will use that information to balance the market?
EliBullockPapa 10 days ago |
This phenomenon is called Alpha Decay. As more market participants exploit the predictor's advantage, the edge diminishes until it disappears.
attentionmech 10 days ago |
thanks!
gmueckl 10 days ago |
There is some very limited value in copying a successful strategy. Once enough market participants follow along, the strategy starts to fail. Markets are erratic because of that dynamic.
wolfman1 10 days ago |
Going to follow along to see how the results look in the months to come.
I've been working on the same concept for the past 2y now and have our performance results here: https://trend.fi/performance
magic_man 10 days ago |
What brokers allow you to short crypto?
wolfman1 10 days ago |
If you're US based, there is no major exchange support. BITI ETF and SETH ETF for shorting BTC and ETH.
If you're non-US: Binance.
SavageBeast 10 days ago |
CME Group has ETH and BTC futures and you can short those easily as theres no notion of "borrowing shares" in the futures world to get short.
jeremycarter 10 days ago |
What's the technology behind this. I'm working on something myself, using a distributed actor model (setup like a graph) to create a living reactive model.
wolfman1 10 days ago |
The model is a multi-threaded Go script running on a 512-thread AMD EPYC server. It's a trend based model so it's just trying to figure out how best to measure and predict trend changes. Not day trading or HFT.
It conducts millions of simulations daily for each asset, then provides a snapshot of the top-performing results to GPT-4o for final selection.
I'm really pushing the limits of GPT-4o currently. I started testing with o1 just last week and it performs better. It's just so much more expensive.
mvdtnz 10 days ago |
Can't verify my email address for the sign-up, it sends me to the domain "undefined".
mickle00 10 days ago |
same, but :%s/undefined/trading.snagra.com/ did the trick
sunnynagra 10 days ago |
Sorry if folks just got resent email verification emails, but I think I fixed the verification url issue and should be addressed.
TripleChecker 10 days ago |
If nothing else, I'm genuinely curious which performs the best over the long-term.
Time to add some side wagers and bet on different models.
inSenCite 10 days ago |
This should be fun to watch
jeffadelic 10 days ago |
How much are your infra costs for everything? And do you pay for the AI APIs or using free tier?
Really cool project and subscribed to follow along.
jasfi 9 days ago |
For Gemini you should use either the latest experimental model (gemini-exp-1206) which should become 2.0 Pro, or 2.0 Flash (a released model). The 1.5 Pro model is way behind.
malux85 9 days ago |
It would be so funny if Gemini shorted Google and made a huge profit
detente18 6 days ago |
Interesting — does your backend server use Python? I couldn't find much about it on your site.
It would be great to see this tested with more commercial LLMs (O1 / Amazon Nova, / Llama 3.2 / etc.). If you're open to it, I’d be happy to contribute support for these models via LiteLLM - https://docs.litellm.ai/docs/providers
mvdtnz 4 days ago |
Mate your shitty app is sending tripled up email barrages. That is absolutely not ok and is illegal in many places.