您好,欢迎访问三七文档
当前位置:首页 > IT计算机/网络 > 数据挖掘与识别 > 7、Apache Spark与Databricks
SparkSummitJune2014ApacheSparkandDatabricksAdoptionAllmajorHadoopdistributionsincludeSparkBeyondHadoopPartnershipsPartnerwithSparkdistributorstoprovidegreatexperiencetoeverySparkuserPartnersCertificationBuildastrongapplicationecosystemSparkAPISparkDistros…DistrosCertSparkApps…AppCertCertificationFreecertificationprocessScriptsforcertifyingSparkdistributions• Developedbycommunity• Open-sourceAnyonewillbeabletocertifyanySparkdistributionTrainingWe’vebeenteachingSparksince2012• 400+peoplethisyearthroughDatabricksJustlaunchedanewtrainingprogram• Alreadyholdworkshopsin5cities300+peoplesignedupfortrainingonWednesdaySolveBigDataChallengesBigPromiseGreatsuccessesusingBigDataBigPromiseYourcompanyhere!EveryorganizationcollectsdataGreatsuccessesusingBigDataBigChallengeGreatsuccessesusingBigDataYourcompanyhere!Google,Facebookspendbillions$todevelop,implement,andrundataanalysistoolsandproductsEveryorganizationcollectsdataTypicalStoryYourcompanystartsaBigDatainitiativeYouaretaskedto…1)BuildaHadoopcluster2)Buildadatapipeline3)Getinsights&builddataproductsClustershardtosetupandmanageNeedtointegrateazoooftoolsToolsarehardtouse(IT)(engineers,datascientists)(engineers,datascientists,analysts)TypicalDataPipelineDataETLExplorationDashboards&ReportsDataProductsIntegratedisparate,clunkytoolsHardtonavigatedata,developanddeployappsAdvancedAnalyticsVisionMakebigdataeasyFromChallengestoSolutionsChallengesSolutionsApacheSparkHostedplatformInteractiveWorkspaceToolsarehardtouseClustershardtosetupandmanageNeedtointegrateazoooftoolsDatabricksCloudDatabricksCloudDatabricksWorkspaceDatabricksPlatformDatabricksPlatform……DatabricksWorkspaceDatabricksPlatformDatabricksPlatformStartclustersinsecondsZero-costmanagementDynamicallyscaleup&downApacheSparkUnifies• Streaming• SQL• Machinelearning• GraphsSinglesystem,singleAPIDatabricksPlatformDatabricksWorkspaceDatabricksWorkspaceDashboardsNotebooksJobsAppsDatabricksPlatformDatabricksWorkspaceNotebooksSupportPython,SQL,ScalaInteractivecommands&plotsOn-linecollaborationDashboardsWYSIWYGbuilderInteractiveplotsOne-clickpublishingJobLauncherRunarbitrarySparkjobs,programmaticallyDramaticallySimplifyDataPipelineDataETLExplorationAdvancedAnalyticsDashboards&ReportsDataProductsCloudDramaticallySimplifyDataPipelineDataFreeuserstofocusonfindinganswers&buildingproductsETLExplorationAdvancedAnalyticsDashboards&ReportsDataProductsCloudDemoAvailabilityStartedclosedbetaprogramearlierthisyearLimitedavailabilitysoon• Graduallyrampingup• Signupondatabricks.com!3rdPartyAppsDatabricksPlatformDatabricksWorkspace3rdPartyAppsDatabricksPlatform…DatabricksWorkspaceAppsDatabricksCloudandSparkDatabricksCloudruns100%ApacheSpark• Nolockin:anyDatabricksCloudapprunsonanycertifiedSparkdistributionDatabricksCloudacceleratesSparkadoption• ProvideeasiestwaytolearnanduseApacheSparkDatabricksCloudDatabricksPlatformDatabricksWorkspaceMakebigdataeasyDramaticallysimplify• analyzingbigdata• buildingdataproductsFuelgrowthofSparkecosystemThankYou!
本文标题:7、Apache Spark与Databricks
链接地址:https://www.777doc.com/doc-4870007 .html