Apple's macroscalar architecture: what it is & what it means

By | July 8, 2011, 9:43am PDT

Summary: What the heck is Apple’s macroscalar processor? And why should we care?

What in heck is Apple’s macroscalar processor? And why should we care?

Apple got a new patent on something called a macroscalar processor this month. You may have seen a write-up on it that quoted the application and stopped there. Because no one knows what it is and why you would want one. But I know.

Macroscalar
Macroscalar is NOT a term of art in computer science - although a quick Bing search will find that it is occasionally used in other disciplines by academics eager not to be understood. Let’s break it down.

Macro means - in a word of 1 syllable - big. Macroeconomics explains the big gears of the economy. Macro lenses make little things big by getting close to them. Office macros make big, complex actions into 1 command.

Scalar means a single value, as opposed to a vector. A vector processor takes a vector of values, say

10 14 666 22 84 53 45 25 14 47 91

and with 1 instruction performs the same operation on all of them, such as multiplying by a single scalar value or another vector.

When you have masses of data to massage - such as in imaging - vector processors speed things up. Which is why the companies that built vector processors for a living got killed by Intel and GPUs.

Scalar OTOH, means a single value. So what the heck is a BIG single value processor?

Parallel processing loops
As the Background of the Invention section in the patent explains, rising clock frequencies aren’t translating into higher performance because most consumer software isn’t optimized to use longer pipelines.

While consumers purchase newer and faster processors with deeper pipelines, the vast majority of software available is still targeted for processors with shorter pipelines. As a result of this the consumer may not realize the full processing potential of a new processor for one to two years after its release, and only after making additional investments to obtain updated software. . . . (I)t is questionable how efficiently deeper pipelines will actually be utilized.

This is important because as CPUs hit Moore’s Wall, more parallelism is the chief way to get more performance - and deeper pipelines are a chief way to achieve instruction-level parallelism.

What is a pipeline? Skip this if you already know.
As defined in Wikipedia an instruction pipeline is a technique used in the design of computers and other digital electronic devices to increase their instruction throughput (the number of instructions that can be executed in a unit of time).

Here’s a graphic example from Wikipedia:

Since a good portion of a typical program’s time is spent in loops, optimizing loops for parallel execution is a popular form of speed up. Loop unrolling and auto-vectorization are 2 common techniques, but they don’t work with data-dependent loops whose length depends on their results rather than a fixed number of iterations.

The invention
The macroscalar processor addresses this problem in a new way: at compile-time it generates contingent secondary instructions so when a data-dependent loop completes the next set of instructions are ready to execute. In effect, it loads another pipeline for, say, completing a loop, so the pipeline remains full whether the loop continues or completes. It can also load a set of sequential instructions that run within or between loops, speeding execution as well.

The macro piece is the large number of program registers required so that all or most possible instruction paths are already loaded into CPU registers for fast execution. Instead of maintaining 1 pipeline, the architecture effectively maintains parallel pipelines, loading them and then switching between them to maximize loop performance.

Much of the patent is focused on compiler coding needed to take advantage of the macroscalar architecture. Essentially the compiler needs to analyze the work flow and understand where run-time decisions will be made so the processor can know when it needs an alternate pipeline.

The Storage Bits take
Apple is, among other things, a fabless semiconductor company. They design, but do not build, their own processors, such as the dual-core A5 in the iPad 2.

The macroscalar architecture, well implemented, offers higher performance for a given clock-speed and lower energy consumption. Both are valuable in mobile devices.

Since Apple provides its own compilers as well as designing CPUs, it is uniquely positioned to offer a complete macroscalar solution to its large band of iOS developers, further widening the price/performance gap between it and the iPad wannabes.

Is it a breakthrough? It could be if the efficiencies it promises can be realized in practice. We’ll have to see just how good Apple’s compiler engineers are.

Comments welcome, of course. Macroscalar is another way to throw transistors at the problem of speeding up computers. I like it.

Kick off your day with ZDNet's daily e-mail newsletter. It's the freshest tech news and opinion, served hot. Get it.

Topics

Robin Harris has been messing with computers for over 30 years and selling and marketing data storage for over 20 in companies large and small.

Disclosure

Robin Harris

Robin Harris is a president of TechnoQWAN, a consulting and analyst firm in northern Arizona. He also writes StorageMojo.com, a blog which accepts advertising from companies in the storage industry, and has a 25 year history with IT vendors. He has many industry contacts, many of whom are friends and all of whom he has opinions about. Robin has relationships with many companies in the technology industry. Every company he writes about may have sought to influence his opinion through carefully-crafted marketing messages and self-serving white papers, gifts ranging from desk calendars, t-shirts, lunches and trips as well as analyst or consulting assignments. He also invests in some technology companies. He may accept payment for services in stock as well. Robin discloses financial investments in or client relationships with companies named in Storage Bits. To help readers sort out the gold from the dross in his writings, Robin tries to communicate his reasons as clearly as he can. If you agree, you are intelligent and discerning. If you disagree, well, you disagree. In all cases, Robin encourages readers to subject everything they read, see or hear on the internet or from politicians to some simple questions: * What assumptions are implicit in the world view and judgments of the author? * What, if any, is the factual basis for the opinions the author expresses? * Is it reasonable, logical and clear? Your critical faculties: use ‘em or lose ‘em!

Biography

Robin Harris

Harris has been messing with computers for over 30 years and selling and marketing data storage for over 20 in companies large and small. He introduced a couple of multi-billion dollar storage products (DLT, the first Fibre Channel array) to market, as well as a many smaller ones. Earlier he spent 10 years marketing servers and networks. After leaving corporate life he founded TechnoQWAN, a consulting and analyst firm. He also developed StorageMojo into one of the top storage industry blogs.

Robin writes, consults, coaches and lives among the mountains of northern Arizona.

20
Comments

Join the conversation!

Just In

RE: Apple's macroscalar architecture: what it is & what it means
FAULKNE 13th Oct
Good day to confirm this comment I would appreciate T h e b e s t o f Z D N e t d e l i v e r e d your website very nice to everyone Yes, Oracle is the only one with shared-disk architecture, but that is there advantage. It means you can add or remove nodes and the database lives on. In a shared nothing architecture, if you lose a node, you lose the system. I'm sure Oracle appreciates EMC highlighting their advantage.I also desire to signal in your RSS feeds. Thank you as soon as once again and maintain up the great operate Awesome post! Thank you very much || thanks for nice content this is really benefit to me.
... this technology is either already implemented in Apple's A5/XCode SDK (which includes regimes for iPad/iPhone/iPod touch), or it will not be done in the near future at all since it might be really good idea theoretically, but not really good in practice because of some not that obvious nuances that break the beauty of the approach.

(It is utterly rare that Apple's patents information precedes soon to be implemented technology. Basically, I am yet to see that variant.)
0 Votes
+ -
How fun!
ego.sum.stig@... 8th Jul
You seem to be the target of "Mr/Ms Flag Them"
Given your description, it's hard to believe this wasn't thought of and implemented around 1994.
@rynning It may have been, but the economics favored more profitable uses of transistors, such as branch prediction. This is feasible now because we have billion transistor chips.
0 Votes
+ -
Thank you for writing on tech
ego.sum.stig@... 9th Jul
A breath of fresh air. Other zdnet bloggers should take note, and flag away "writing" the "omg, read my pap, it's important for page hits" froth and bubble that is way too common on zdnet these days.
  • Flagged
0 Votes
+ -
Question
ego.sum.stig@... 10th Jul
So, how could you take advantage of this as a coder/designer? Do you have anything on that?
0 Votes
+ -
Article title is more than just a little
thx-1138_@... Updated - 16th Jul
@Robin Harris .. misleading.

"...Apple got a new patent on something called a macroscalar processor this month."

Is the blog about patents or Storage Bits / Media? I mean, that's the subject matter you chose to open with.

So, your first mention is of a 'patent' .. not an actual technology. What's the significance of a patent to 'The Storage Bits take'?? While you're at it, can you eloborate on how 'patents' relate to CPU clock timings and their subsequent affect on I/O scheduling (for example).

"...Apple is, among other things, a fabless semiconductor company. They design, but do not build, their own processors,"

Which begs the obvious question: since when did Apple magically become "great designers of processors"??

Then you throw in this completely contradictory doozey ..

"...Since Apple provides its own ... as well as designing CPUs..."

Are we now to believe that Apple have been designing CPU's and that Intel (and now AMD) actually haven't been providing processors to Apple?

I'm having difficulty finding much media on Apple supplanting Intel and AMD in the processor space .. is there some insider info' you're holding out on us with?

Now look, the macro-scalar concepts involved sound great ... i simply don't get why you felt the need to even mention the word 'patent' in a blog that supposedly revolves around Storage Media and related processing.

That aside, there are inconsistencies and misleading material in your write-up that just plain don't add up.

So .. please, do elaborate.
Thanks for sharing the knowledge. It was a pleasure to read and learn from your blog
0 Votes
+ -
When real articles were written about technology rather than just reviews. Good stuff.
0 Votes
+ -
Updated Information Links:
jogn_smith Updated - 8th Jul
A *much* better explanation can be found in Apple's patent filings.
http://www.google.com/search?q=20100042815&btnG=Search+Patents&tbm=pts
@jogn_smith

Thanks for this. I tried hard to understand above but i really am having a hard time understanding it. lol. pacquiao vs marquez tickets
I also desire to signal in your RSS feeds. Thank you as soon as once again and maintain up the great operate! nccma cooler
I used to be more than happy to seek out this internet-site.I wanted to thanks in your time for this glorious read!! I positively enjoying each little bit of it and I have you bookmarked to check out new stuff you weblog post. this thread is amazing i like your work and i appreciate you that you have share a useful stuff thanks for sharing the i shop abatwa
I used to be more than happy to seek out this internet-site.I wanted to thanks in your time for this glorious read!! I positively enjoying each little bit of it and I have you bookmarked to check out new stuff you weblog post.Bookmarking now thanks please consider a follow up post. power sa shop
I think the representation of this article is actually superb one. This is my first visit to your site. Thanks a lot and keep sharing the information. Keep updating the information for all of us. Thanks ZDNet Government was launched as the brand's first industry vertical, with a mission to cater to IT professionals in the public secto I agree with your post. However, do you have any sources I can cite for my paper wheel car com bury
Well welcome, hopefully you can become a vital member of the community and really help to push far ahead of google. Which Im sure the development team would love. This will of course earn you alot points too and get you on the leaders board. z d n e t t h a n k Im not sure i come to an agreement with you on every level, howevor it absolutely was a good posting, many thanks for taking the time to put up your ideas.
Thanks nice info z d n e t I really liked your current article write more..let me add you to its favorite
I really enjoyed reading this post !!!have bookmarked w e b s will come back to read more.
Fantastic news about the new release.I positively enjoying each little bit of it and I have you b o o k m a r k e d to check out new stuff you weblog post.Im not sure i come to an agreement with you on every level, howevor it absolutely was a good posting, many thanks for taking the time to put up your ideas
Good day to confirm this comment I would appreciate T h e b e s t o f Z D N e t d e l i v e r e d your website very nice to everyone Yes, Oracle is the only one with shared-disk architecture, but that is there advantage. It means you can add or remove nodes and the database lives on. In a shared nothing architecture, if you lose a node, you lose the system. I'm sure Oracle appreciates EMC highlighting their advantage.I also desire to signal in your RSS feeds. Thank you as soon as once again and maintain up the great operate Awesome post! Thank you very much || thanks for nice content this is really benefit to me.

Join the conversation!

Formatting +
BB Codes - Note: HTML is not supported in forums
  • [b] Bold [/b]
  • [i] Italic [/i]
  • [u] Underline [/u]
  • [s] Strikethrough [/s]
  • [q] "Quote" [/q]
  • [ol][*] 1. Ordered List [/ol]
  • [ul][*] · Unordered List [/ul]
  • [pre] Preformat [/pre]
  • [quote] "Blockquote" [/quote]
ie8 fix

The best of ZDNet, delivered

ZDNet Newsletters

Get the best of ZDNet delivered straight to your inbox

Facebook Activity

White Papers, Webcasts, & Resources
ie8 fix