• 0 Posts
  • 185 Comments
Joined 3 years ago
Cake day: July 5th, 2023


  • Sharing trade secrets under the terms of a contract that dictates how one can use the information still retains trade secret protections.

    Without a contract: intentional disclosure to the person who receives it generally destroys the trade secret status of the information, because the “owner” of the information didn’t do a good job trying to protect it.

    With a contract: intentional disclosure to a person under the terms of the contract makes the contract’s own protections of the information relevant, and misuse of the information by the recipient can get them sued under the contract. Plus, the information itself probably retains trade secret protection so that even if that person gives the information to a third party who can’t be sued under a contract they never agreed to, there are still rights to protect that trade secret as property.

    I’d be shocked if any paid API use isn’t under a robust, enforceable contract. The only question is whether the contract language itself effectively prohibits distillation.



  • The actual process of creating semiconductors is basically:

    1. Etch a stencil that has the pattern you want.
    2. Place the stencil over a piece of silicon.
    3. Bombard the silicon and stencil with radiation so that the chemical properties of the silicon change exactly under that stencil.
    4. Repeat the process with multiple other stencils, so that the resulting silicon has basically shapes of wires and logic gates that can perform different functions with the electricity running through those shapes.

    In recent years, step 3 has gotten so complicated, based on needing to create radiation of exactly a particular wavelength of extreme ultraviolet light focused exactly on the silicon (and the mask/stencil above it), because that wavelength allows for the smallest possible features on the silicon. So they take purified tin, melt the tin into molten liquid, and ejecting the molten tin in a liquid jet downward into a vacuum at exactly the right speed to where it forms into droplets of the exact size for the machine (about 50 μm), then blasts each droplet, mid-fall, with a 1.6kW laser that heats it up so hot that it vaporizes and ionizes into plasma at the exact position where a system of highly polished and precisely positioned mirrors focuses the UV radiation evenly onto the silicon surface.

    Oh, and the machine makes one tin droplet every 1/50,000 of a second, so in any given second it ionizes 50,000 droplets in the stream.

    The machine costs something like $300 million, and requires full time experts to make sure that it’s working correctly.

    Everything else in the fabrication facility is similarly complicated, which is why a fab represents something like $30 billion in total costs over its lifetime.


  • The Arch Wiki describes the AUR in plain terms: it’s a user-submitted community repository of software, not warranted to be safe or even vetted by Arch maintainers, packaged to be friendly with pacman.

    If you’re doing things the “Arch way” the differences between the AUR and officially supported packages should be obvious, and you should at the very least skim the PKGBUILD files to understand where things are coming from and how they work.





  • It’s called decoding and encoding.

    But the big data centers doing all the video processing for the big video services (including both permanent videos from a library and things like live streaming) are encoding the videos with settings that require less computational power to decode. The idea is to be able to let even old budget smartphones still be able to display the video with very low power requirements on the client device. There’s no universe where consumers decoding digital video will be a high-power computational task.

    Restaurants have sharp knives in the kitchen, but generally serve food that requires only minimal cutting effort from the table knives set out with the rest of the table settings. Dining will always be easier than cooking, by a margin that makes the difficulty of dining not worth mentioning, so it would be bizarre to criticize a knife as being only good for cooking and eating food, when plenty of dining tableware knives out there would be insufficient for kitchen work.

    You’ve made the mistake of lumping decoding and encoding together based on the algorithmic/mathematical similarity of those tasks, when everyone else is more inclined to discuss the very different end user use cases of those computing needs.


  • I don’t think government funding can actually offset the crash in consumer and business demand being insufficient to cover the cost of the most expensive models on the most expensive GPUs. But if you look through my comment history I’ve made the comparison to supersonic flight, because I genuinely believe there’s a possibility that governments fund the expensive branch of this technology for their own military or surveillance or law enforcement purposes without the benefits necessarily actually spilling out into normal commercial applications.

    We’ve hit the point where training a model (both pre training and post training) isn’t the expensive part, and the expensive part is actual inference, which makes it hard to scale the most expensive models to where it’s useful for a lot of people. So it might be that the companies and governments that can afford to operate an expensive model might be the only ones to do it. And they’ll be able to, without necessarily the public being able to have access to the same tech.




  • There’s just no way to pay for the cost of these services, though.

    When someone constructs a 100 MW data center (now considered a smaller one for new construction), that’s about $2 billion in total costs to outfit the whole operation. And then once it’s on, we’re talking something like $10-20 million/month in electricity alone, and a few million in other costs. How many $20 subscriptions do you need to sell just to break even with your operating expenses? How many $100/month subscriptions do you need to sell to make a dent on your interest payments on the construction? Will there be a market for $1000/month subscriptions from millions of customers? If not, how’s this all going to be paid for?








  • The only solution is to make sure they can’t read data you don’t want shared.

    Isn’t that the appropriate guardrail, then? LLM chats and agents and whatever need to be contained with external permissions settings that the LLMs simply do not and can never have the power to override.

    In a normal customer service setting with human agents, there are still plenty of examples of what a human agent simply doesn’t have the power to do. Often, they’ll need to escalate to a manager to do things like process refunds not just because they weren’t given social permission to do so, but because they weren’t given technical permissions to do so. LLM agents need to be contained in the same way. Any decent use of agents, human or software, requires carefully designed processes and permissions extrinsic to that agent’s own decisionmaking abilities to make sure that agents don’t do something bad for the company.