硬气是什么意思| 身上长扁平疣是什么原因造成的| 心梗是什么意思| 扁桃体有什么用| 硫酸羟氯喹片治什么病| 梦见前婆婆是什么意思| 相思病是什么意思| 羧甲基纤维素钠是什么| 补充镁有什么好处| 第二学士学位是什么意思| 勰读什么| 蒙古国什么时候独立的| 姐姐的女儿叫什么| 赵云字什么| 吃马齿苋有什么好处| 精子碎片率高吃什么药| 最近老是犯困想睡觉是什么原因| 心脏在乳房的什么位置| 效果图是什么意思| 淀粉是什么粉| 梦见和老公吵架是什么预兆| 老虎吃什么食物| 然五行属性是什么| 得偿所愿什么意思| 叶黄素是什么东西| 技压群雄的意思是什么| 吊是什么意思| 什么叫扁平疣| sars是什么意思| 筋是什么组织| 美版苹果和国行有什么区别| 易孕期是什么时候| 友女是什么意思| 梦见蛇蛋是什么意思啊| 吃黄瓜有什么好处| iqr是什么意思| 膑是什么意思| 车辙是什么意思| 剖腹产可以吃什么水果| cp什么意思网络用语| ph值低是什么原因| 阴道口痒用什么药好| 孕妇能喝什么茶| 獠牙是什么意思| au9999是什么意思| 水手是干什么的| fq交友是什么意思| 哺乳期胃疼可以吃什么药| 卵巢检查做什么项目| 大堤是什么意思| 什么是疱疹怎么得的| 血小板低有什么危险| 吃鸭蛋有什么好处和坏处| 吃什么指甲长得快| 扳机点是什么意思| 血清碱性磷酸酶高是什么意思| 吃什么能帮助睡眠| 生二胎应该注意什么| 内径是什么意思| 红苋菜不能和什么一起吃| aimer是什么意思| 甲状腺有什么功能| ntr是什么意思啊| 背部长痘痘是什么原因造成| 梦见着火是什么预兆| 白醋加盐洗脸有什么好处| 女人亏气亏血吃什么补的快| 晚上尿多是什么病| 嗓子痒痒老想咳嗽是什么原因| atp是什么| 老是头疼是什么原因| 山药有什么功效| cupid什么意思| instagram是什么意思| 刮脸有什么好处与坏处| 体香是什么味道| 复合维生素b片主治什么病| 血管炎是什么病| 01年的属什么| 芈月和秦始皇是什么关系| 痔疮用什么药治最好效果最快| anca医学上是什么意思| 淋巴细胞降低说明什么| 未见卵黄囊是什么意思| 婚检都检查什么| 口苦口干吃什么药| 女性长期缺维d会带来什么病| 地盆是一种什么病| 萎靡不振是什么意思| 舌头溃疡吃什么药最好| 高校新生是什么意思| 角是什么生肖| 什么动物睡觉不闭眼睛| 甲亢在中医里叫什么病| 地震为什么会发生| 梦见发大水是什么意思| 转氨酶高是什么意思| 心机血缺血吃什么药最好| 湿气重会有什么症状| 结膜炎是什么| 巴西货币叫什么| 锡字五行属什么| 胆红素偏高挂什么科| 火乐念什么| 驴血为什么是白色的| 陶渊明是什么派诗人| 男人额头凹陷预示什么| 脾虚是什么原因引起的| hdv是什么病毒| 幼儿急疹吃什么药| 哺乳期吃避孕药对孩子有什么影响| giada是什么牌子| 一拃是什么意思| bb粥指的是什么意思| 什么金属最贵| 自主神经功能紊乱吃什么药| 增强记忆力吃什么| 自行车什么牌子好| sv是什么意思| 斑秃去医院挂什么科| 打火机的气体是什么| 母亲节说什么| 支原体感染吃什么药| 正营级是什么军衔| 磨破皮了涂什么药| 什么的脑袋| act是什么意思| 脚后跟疼吃什么药| 撸管是什么| 减脂是什么意思| les是什么意思| 长痱子是什么原因| 红虾是什么虾| 红配什么颜色最好看| 电头是什么| 百雀羚属于什么档次| 地藏王菩萨保佑什么| 虫草花不能和什么一起吃| 胃难受是什么原因| 死库水是什么意思| 生化检查能查出什么病| 凌寒独自开的凌是什么意思| aj是什么鞋子| 10.8号是什么星座| 奀是什么意思| 唐筛临界风险是什么意思| 歌声什么| 为什么睡觉会打呼| 游园惊梦讲的是什么| 梦见发大水是什么预兆| 医学ac是什么意思| 高考三百多分能上什么学校| 澳门用什么币种| ggo是什么意思| 晚上睡觉经常醒是什么原因| emma什么意思| 吃什么不胖反而瘦| 真菌是什么原因引起的| 五十八岁属什么生肖| 引渡是什么意思| 尿细菌高是什么原因| 沙参长什么样子图片| 人咬人有什么危害| 鸡眼长什么样| 吃什么头发长得快| 性有什么好处和坏处| 燕窝什么时候吃好| px是什么| 头晕眼睛模糊是什么原因| 轧戏是什么意思| 夏天什么花开| 什么烧鸭子好吃| 湿气是什么原因引起的| 什么地说话| 青岛是什么省| 凯撒沙拉酱是什么口味| 女人左手麻要注意什么| 96年是什么年| 肾在五行中属什么| 条条框框是什么意思| 夜尿多吃什么药| 连衣裙配什么鞋子好看| 梦见梅花鹿是什么预兆| 肛门痛是什么原因| 夏至是什么意思| 头部出汗多是什么原因| 智五行属什么| 红枣有什么功效和作用| 山药为什么煮熟了也麻口| 公务员是做什么工作的| 为什么会牙疼| 壮字五行属什么| mgd是什么意思| 打嗝不停是什么原因| 瘿瘤是什么病| 逆袭什么意思| 榄仁叶是什么树的叶子| 自学成才是什么意思| 升结肠憩室是什么意思| 油菜花是什么季节开的| 唐宝是什么意思| 化疗为什么要剃光头| 何以笙箫默是什么意思| 舌头麻木是什么征兆| 腋毛癣用什么药膏| 净值是什么| r值是什么| 妈妈的奶奶叫什么| 儿童舌系带短挂什么科| 新生儿老是打嗝是什么原因| 穿匡威的都是什么人| 去湿气吃什么最好| 高丽参和红参有什么区别| 戒掉手淫有什么好处| 阳性血是什么意思| 嗓子哑是什么原因| 杀手锏是什么意思| 胃肠炎吃什么药好| 什么烟最好抽| 子午相冲是什么意思| 呼吸困难是什么原因| 掉头发缺少什么维生素| 畈是什么意思| 女人为什么会宫外怀孕| 月经不来是什么原因导致的| mpn是什么意思| 表头是什么| x58主板配什么cpu| 木瓜吃了有什么好处| 血沉高是什么原因| mr和mri有什么区别| eis是什么意思| 回南天是什么时候| 孩子结膜炎用什么眼药水| 彩超检查什么| 痧是什么| 头晕恶心呕吐是什么原因| 桃子不能和什么食物一起吃| 素肉是什么| 脑膜炎是什么病严重吗| 什么是情劫| 吃了什么药不能喝酒| 1976年是什么命| surprise是什么意思| 93岁属什么生肖| 董字五行属什么| 花干是什么做的| 肌酐高说明什么问题| 樱花什么时候开| 冒节子是什么东西| 博五行属性是什么| 心火大吃什么药| 猫在风水上代表什么| 什么的桃花| 甲状腺应该多吃什么| 你正在干什么用英语怎么说| 1.18是什么星座| 绝膑而亡是什么意思| cosplay是什么| 包皮开裂擦什么药膏| 胃疼可以吃什么食物| 40而不惑是什么意思| 报单什么意思| 什么是代沟| 百度Jump to content

国际珠宝网 - 珠宝品牌直达 Jewelry brand direct

From Wikipedia, the free encyclopedia
Coherent Accelerator Processor Interface
Year created2014; 11 years ago (2014)
Created by
百度 鍚夋灄澶у绀句細绉戝瀛︽姤缂栬緫閮/h1>EditorialDepartmentofJilinUniversityJournal,SocialSciencesEdition涔犺繎骞虫柊鏃朵唬涓浗鐗硅壊绀句細涓讳箟鎬濇兂鐮旂┒鍙嶈厫璐ヤ笓棰樼爺绌/h1>[162K][涓嬭浇娆℃暟锛FONTcolor=red>19]|[缃戝垔涓嬭浇娆℃暟锛FONTcolor=red>0]|[寮曠敤棰戞锛FONTcolor=red>0]|[闃呰娆℃暟锛FONTcolor=red>6]鐜娉曞緥鍒跺害鐮旂┒[136K][涓嬭浇娆℃暟锛FONTcolor=red>20]|[缃戝垔涓嬭浇娆℃暟锛FONTcolor=red>0]|[寮曠敤棰戞锛FONTcolor=red>0]|[闃呰娆℃暟锛FONTcolor=red>2]璐㈢粡鍓嶆部娌堥涓鎴垮缓濂[221K][涓嬭浇娆℃暟锛FONTcolor=red>36]|[缃戝垔涓嬭浇娆℃暟锛FONTcolor=red>0]|[寮曠敤棰戞锛FONTcolor=red>0]|[闃呰娆℃暟锛FONTcolor=red>2]鍥藉寤鸿涓庣ぞ浼氭不鐞/h1>[185K][涓嬭浇娆℃暟锛FONTcolor=red>22]|[缃戝垔涓嬭浇娆℃暟锛FONTcolor=red>0]|[寮曠敤棰戞锛FONTcolor=red>0]|[闃呰娆℃暟锛FONTcolor=red>2]闄堝弸鍗鏂芥棖鏃鎺㈢储褰撲唬涓浗鍝插鐨勯亾璺/h1>[153K][涓嬭浇娆℃暟锛FONTcolor=red>0]|[缃戝垔涓嬭浇娆℃暟锛FONTcolor=red>0]|[寮曠敤棰戞锛FONTcolor=red>0]|[闃呰娆℃暟锛FONTcolor=red>1]鐢版櫤蹇姹夎璇█瀛闊抽煹涓庢柟瑷€涓撻姹摱宄?绀剧淇℃伅

Coherent Accelerator Processor Interface (CAPI), is a high-speed processor expansion bus standard for use in large data center computers, initially designed to be layered on top of PCI Express, for directly connecting central processing units (CPUs) to external accelerators like graphics processing units (GPUs), ASICs, FPGAs or fast storage.[1][2] It offers low latency, high speed, direct memory access connectivity between devices of different instruction set architectures.

History

[edit]

The performance scaling traditionally associated with Moore's Law—dating back to 1965—began to taper off around 2004, as both Intel's Prescott architecture and IBM's Cell processor pushed toward a 4 GHz operating frequency. Here both projects ran into a thermal scaling wall, whereby heat extraction problems associated with further increases in operating frequency largely outweighed gains from shorter cycle times.

Over the decade that followed, few commercial CPU products exceeded 4 GHz, with the majority of performance improvements now coming from incrementally improved microarchitectures, better systems integration, and higher compute density—this largely in the form of packing a larger numbers of independent cores onto the same die, often at the expense of peak operating frequency (Intel's 24-core Xeon E7-8890 from June 2016 has a base operating frequency of just 2.2 GHz, so as to operate within the constraints of a single-socket 165 W power consumption and cooling budget).

Where large performance gains have been realized, it was often associated with increasingly specialized compute units, such as GPU units added to the processor die, or external GPU- or FPGA-based accelerators. In many applications, accelerators struggle with limitations of the interconnect's performance (bandwidth and latency) or with limitations due to the interconnect's architecture (such as lacking memory coherence). Especially in the datacenter, improving the interconnect became paramount in moving toward a heterogeneous architecture in which hardware becomes increasingly tailored to specific compute workloads.

CAPI was developed to enable computers to more easily and efficiently attach specialized accelerators. Memory intensive and computation intensive works like matrix multiplications for deep neural networks can be offloaded into CAPI-supported platforms.[3] It was designed by IBM for use in its POWER8 based systems which came to market in 2014. At the same time, IBM and several other companies founded the OpenPOWER Foundation to build an ecosystem around Power based technologies, including CAPI. In October 2016 several OpenPOWER partners formed the OpenCAPI Consortium together with GPU and CPU designer AMD and systems designers Dell EMC and Hewlett Packard Enterprise to spread the technology beyond the scope of OpenPOWER and IBM.[4]

On August 1, 2022, OpenCAPI specifications and assets were transferred to the Compute Express Link (CXL) Consortium.[5]

Implementation

[edit]

CAPI

[edit]

CAPI is implemented as a functional unit inside the CPU, called the Coherent Accelerator Processor Proxy (CAPP) with a corresponding unit on the accelerator called the Power Service Layer (PSL). The CAPP and PSL units acts like a cache directory so the attached device and the CPU can share the same coherent memory space, and the accelerator becomes an Accelerator Function Unit (AFU), a peer to other functional units integrated in the CPU.[6][7]

Since the CPU and AFU share the same memory space, low latency and high speeds can be achieved since the CPU doesn't have to do memory translations and memory shuffling between the CPU's main memory and the accelerator's memory spaces. An application can make use of the accelerator without specific device drivers as everything is enabled by a general CAPI kernel extension in the host operating system. The CPU and PSL can read and write directly to each other's memories and registers, as demanded by the application.

CAPI

[edit]

CAPI is layered on top of PCIe Gen 3, using 16 PCIe lanes, and is an additional functionality for the PCIe slots on CAPI enabled systems. Usually there are designated CAPI enabled PCIe slots on such machines. Since there is only one CAPP per POWER8 processor the number of possible CAPI units are determined by the number of POWER8 processors, regardless of how many PCIe slots there are. In certain POWER8 systems, IBM makes use of dual chip modules, thus doubling the CAPI capacity per processor socket.

Traditional transactions between a PCIe device and a CPU can take around 20,000 operations, whereas a CAPI attached device will only use around 500, significantly reducing latency, and effectively increasing bandwidth due to decreased operations overhead.[7]

The total bandwidth of a CAPI port is determined by the underlying PCIe 3.0 x16 technology, peaking at ca 16 GB/s, bidirectional.[8]

CAPI 2

[edit]

CAPI-2 is an incremental evolution of the technology introduced with IBM POWER9 processor.[8] It runs on top of PCIe Gen 4 that effectively doubles the performance to 32 GB/s. It also introduces some new features like support for DMA and Atomics from the accelerator.

OpenCAPI

[edit]

The technology behind OpenCAPI is governed by the OpenCAPI Consortium, founded in October 2016 by AMD, Google, IBM, Mellanox and Micron together with partners Nvidia, Hewlett Packard Enterprise, Dell EMC and Xilinx.[9]

OpenCAPI 3

[edit]

OpenCAPI, formerly New CAPI or CAPI 3.0, is not layered on top of PCIe and will therefore not use PCIe slots. In IBM's CPU POWER9 it will use the Bluelink 25G I/O facility that it shares with NVLink 2.0, peaking at 50 GB/s.[10] OpenCAPI doesn't need the PSL unit (required for CAPI 1 and 2) in the accelerator, as it's not layered on top of PCIe but uses its own transaction protocol.[11]

OpenCAPI 4

[edit]

Planned for future chip after the General Availability of POWER9.[12]

OMI

[edit]

OpenCAPI Memory Interface (OMI) is a serial attached RAM technology based on OpenCAPI, providing low latency, high bandwidth connection for main memory. OMI uses a controller chip on the memory modules that allows for technology agnostic approach to what is used on the modules, be it DDR4, DDR5, HBM or storage class non-volatile RAM. An OMI based CPU can therefore change RAM type by changing the memory modules.

A serial connection uses less floorspace for the interface on the CPU die therefore potentially allowing more of them compared to using common DDR memory.

OMI is implemented in IBM's Power10 CPU, which has 8 OMI memory controllers on-chip, allowing for 4 TB RAM and 410 GB/s memory bandwidth per processor. These DDIMMs (Differential Dynamic Memory Module) includes a OMI controller and memory buffer, and can address individual memory chips for fault tolerance and redundancy purposes.

Microchip Technology manufactures the OMI controller on the DDIMMs. Their SMC 1000 OpenCAPI memory is described as "the next progression in the market adopting serial attached memory."[13]

See also

[edit]

Legacy

Contemporary

References

[edit]
  1. ^ Agam Shah (17 December 2014). "IBM's new Power8 doubles performance of Watson chip". PC World. Archived from the original on 1 February 2018. Retrieved 17 December 2014.
  2. ^ "IBM Power8 Processor Detailed - Features 22nm Design With 12 Cores, 96 MB eDRAM L3 Cache and 4 GHz Clock Speed". WCCFtech. 27 August 2013. Retrieved 17 December 2014.
  3. ^ Md Syadus Sefat, Semih Aslan, Jeffrey W Kellington, Apan Qasem (2025-08-14). "Accelerating HotSpots in Deep Neural Networks on a CAPI-Based FPGA". 2019 IEEE 21st International Conference on High Performance Computing and Communications; IEEE 17th International Conference on Smart City; IEEE 5th International Conference on Data Science and Systems (HPCC/Smart City/DSS). IEEE. pp. 248–256. doi:10.1109/HPCC/SmartCity/DSS.2019.00048. ISBN 978-1-7281-2058-4. S2CID 203656070.{{cite book}}: CS1 maint: multiple names: authors list (link)
  4. ^ OpenCAPI Unveiled: AMD, IBM, Google, Xilinx, Micron and Mellanox Join Forces in the Heterogenous Computing Era
  5. ^ CXL Consortium and OpenCAPI Consortium Sign Letter of Intent to Transfer OpenCAPI Specifications to CXL
  6. ^ Coherent Accelerator Processor Interface (CAPI) for POWER8 Systems – White Paper
  7. ^ a b Reconfigurable Accelerators for Big Data and Cloud – RAW 2016
  8. ^ a b Opening Up The Server Bus For Coherent Acceleration
  9. ^ Tech Leaders Unite to Enable New Cloud Datacenter Server Designs for Big Data, Machine Learning, Analytics, and other Emerging Workloads
  10. ^ Big Blue Aims For The Sky With Power9
  11. ^ OpenCAPI Takes on PCIe, Vows 10X Improvement
  12. ^ Stuecheli, Jeff (26 January 2017). "Webinar POWER9" (Video recording / slides). AIX Virtual User Group. - Slides (PDF) - AIX VUG page has links to slides and video
  13. ^ Patrick Kennedy (August 5, 2019), Microchip SMC 1000 For The Serial Attached Memory Future, Servethehome
[edit]
泌尿感染吃什么药最好 人为什么会焦虑 诚五行属什么 竹叶青是什么茶 5.2号是什么星座
60岁属什么 批捕意味着什么 蚂蝗是什么 碳酸钠是什么东西 6岁属什么
刚感染艾滋病什么症状 黄金有什么用 lac是什么意思 肺部有问题一般会出现什么症状 日照有什么好吃的
促甲状腺激素偏高有什么症状 荷叶和什么搭配最减肥 马来西亚属于什么国家 阴道是什么味道 胳膊上种花是什么疫苗
促甲状腺激素高是什么意思hcv9jop3ns6r.cn 看胆囊挂什么科hcv9jop2ns5r.cn 输卵管堵塞是什么原因造成的hcv9jop3ns6r.cn 做爱是什么感觉hcv9jop2ns4r.cn 排尿困难是什么原因男性hcv8jop2ns1r.cn
酒糟是什么hcv7jop6ns2r.cn 拉拉是什么意思xinjiangjialails.com 尿亚硝酸盐阳性是什么意思hcv8jop4ns4r.cn 肌酐高有什么危害hcv9jop1ns5r.cn 摸胸是什么感觉hcv8jop2ns3r.cn
生理期吃什么好hcv9jop2ns3r.cn 梦见和别人打架是什么意思hcv7jop9ns8r.cn 更年期补钙吃什么钙片好hcv8jop9ns8r.cn 黑米和什么一起搭配煮粥最佳hcv8jop5ns7r.cn 胰腺疼痛吃什么药hcv7jop5ns1r.cn
感冒嗓子哑了吃什么药hcv8jop0ns2r.cn 伤口不愈合是什么原因hcv8jop0ns1r.cn 父亲节什么时候hcv8jop5ns1r.cn 7月13日是什么节日hcv9jop5ns8r.cn 赛马不相马什么意思hcv7jop9ns1r.cn
百度