MSBI (SSIS/SSRS/SSAS) Online Training

Thursday, October 25, 2012

MCITP 70-452 Passed

Hi All,

I have completed MCITP 70-452 certification successfully.


Exam Profile :
MCITP 70-452: Designing a Business Intelligence Infrastructure Using Microsoft SQL Server 2008
http://www.microsoft.com/learning/en/us/exam.aspx?id=70-452#tab1



MSBI (SSIS/SSRS/SSAS) Online Training:



Regards,

Ravindra

Sunday, October 21, 2012

MS : SSAS - Best Practices – Cube & MDX





Cube Design Best Practices - Dimensions


• Consolidate multiple hierarchies into single dimension (unless they are related via fact table)

• Avoid ROLAP storage mode

• Use role playing dimensions (e.g. OrderDate, BillDate, ShipDate) - avoids multiple physical copies

• Use parent-child dimensions prudently

o No aggregation support

• Set Materialized = true on reference dimensions

• Use many-to-many dimensions prudently

o Slower than regular dimensions, but faster than calculations

o Intermediate measure group must be “small” relative to primary measure group



Cube Design Best Practices – Attributes/Hierarchies

• Define all possible attribute relationships!

• Remove redundant attribute relationships

• Mark attribute relationships as rigid where appropriate

• Use integer (or numeric) key columns

• Set AttributeHierarchyEnabled to false for attributes not used for navigation (e.g. Phone#, Address)

• Set AttributeHierarchyOptimizedState to NotOptimized for infrequently used attributes

• Set AttributeHierarchyOrdered to false if the order of members returned by queries is not important

• Use natural hierarchies where possible



Cube Design Best Practices – Measures

• Use smallest numeric data type possible

• Use semi-additive aggregate functions instead of MDX calculations to achieve same behavior

• Put distinct count measures into separate measure group (BIDS does this automatically)

• Avoid string source column for distinct count measures



Cube Design Best Practices – OLAP Partitions

• No more than 20M rows per partition

• Specify partition slice

o Optional for MOLAP – server auto-detects the slice and validates against user specified slice (if any)

o Must be specified for ROLAP

• Manage storage settings by usage patterns

o Frequently queried  MOLAP with lots of aggregations

o Periodically queried  MOLAP with less or no aggregations

o Historical  ROLAP with no aggregations

• Alternate disk drive - use multiple controllers to avoid I/O contention

• Remote partitions for scale out – VLDB



Cube Design Best Practices – Aggregations

• Define all possible attribute relationships

• Set accurate attribute member counts and fact table counts

• Set AggregationUsage to guide aggregation designer

o Set rarely queried attributes to None

o Set commonly queried attributes to Unrestricted

• Do not build too many aggregations

o In the 100s, not 1000s

• Do not build aggregations larger than 30% of fact table size (agg design algorithm doesn’t)



MDX Query Design Best Practices

• Use calculated members instead of calc cells where possible

• Use .MemberValue for calculations on numeric attributes

o Filter(Customer.members, Salary.MemberValue > 100000)

• Avoid using CalculationPassValue

o Rely on auto recursion resolution using scopes and assignments

• Avoid redundant use of .CurrentMember and .Value

o (Time.CurrentMember.PrevMember, Measures.CurrentMember ).Value can be replaced with Time.PrevMember

• Avoid LinkMember, StrToSet, StrToMember, StrToValue

• Replace simple calculations with computed columns in DSV

o Calculation done at processing time is always better

• Many more at:

o http://sqljunkies.com/weblog/mosha

o http://sqlserveranalysisservices.com



Source:

http://blogs.msdn.com/b/jbasilico/archive/2008/04/01/analysis-services-2005-design-best-practices-cubes-mdx.aspx



MSBI (SSIS/SSRS/SSAS) Online Training:




Monday, October 8, 2012

CDC in SQL Server 2008/2012

Source tables change over time. A data mart or data warehouse that is based on those tables needs to reflect these changes. However, a process that periodically copies a snapshot of the entire source consumes too much time and resources. Alternate approaches that include timestamp columns, triggers, or complex queries often hurt performance and increase complexity. What is needed is a reliable stream of change data that is structured so that it can easily be applied by consumers to target representations of the data. Change data capture in SQL Server provides this solution.




The change data capture feature of the Database Engine captures insert, update, and delete activity applied to SQL Server tables, and makes the details of the changes available in an easily-consumed, relational format. The change tables used by change data capture contain columns that mirror the column structure of the tracked source tables, along with the metadata needed to understand the changes that have occurred on a row by row basis.

Source:

http://msdn.microsoft.com/en-us/library/bb895315.aspx

http://www.mattmasson.com/index.php/2011/12/cdc-in-ssis-for-sql-server-2012-2/



MSBI (SSIS/SSRS/SSAS) Online Training:



Best Regards,

Ravindra

sql server 2012 new features



Quite simply, SQL Server 2012 is going to revolutionize the way we think about availability and recovery. Today, when we think about these concepts, we consider an individual database (log shipping, replication, mirroring) or an entire SQL Server instance (failover clustering). In SQL Server 2012, in addition to Failover Clustering enhancements, we’ll be able to treat a group of databases as an entity – a much more common requirement with today’s complex, multi-database applications.


You may download fee ebook from below Microsoft site:

http://download.microsoft.com/download/F/F/6/FF62CAE0-CE38-4228-9025-FBF729312698/Microsoft_Press_eBook_Introducing_Microsoft_SQL_Server_2012_PDF.pdf



Microsoft has introduced SQL Server 2012 to the world and it's time for IT professionals to start to come to speed on what's new in this highly anticipated version of SQL Server.

1. AlwaysOn Availability Groups -- This feature takes database mirroring to a whole new level. With AlwaysOn, users will be able to fail over multiple databases in groups instead of individually. Also, secondary copies will be readable, and can be used for database backups. The big win is that your DR environment no longer needs to sit idle.

2. Windows Server Core Support -- If you don't know what Windows Server Core is, you may want to come up to speed before Windows 8 (MS is making a push back to the command line for server products). Core is the GUI-less version of Windows that uses DOS and PowerShell for user interaction. It has a much lower footprint (50% less memory and disk space utilization), requires fewer patches, and is more secure than the full install. Starting with SQL 2012, it is supported for SQL Server.

3. Columnstore Indexes -- This a cool new feature that is completely unique to SQL Server. They are special type of read-only index designed to be use with Data Warehouse queries. Basically, data is grouped and stored in a flat, compressed column index, greatly reducing I/O and memory utilization on large queries.

4. User-Defined Server Roles -- DBAs have always had the ability to create custom database role, but never server wide. For example, if the DBA wanted to give a development team read/write access to every database on a shared server, traditionally the only ways to do it were either manually, or using undocumented procedures. Neither of which were good solutions. Now, the DBA can create a role, which has read/write access on every DB on the server, or any other custom server wide role.

5. Enhanced Auditing Features -- Audit is now available in all editions of SQL Server. Additionally, users can define custom audit specifications to write custom events into the audit log. New filtering features give greater flexibility in choosing which events to write to the log.

6. BI Semantic Model -- This is replacing the Analysis Services Unified Dimensional Model (or cubes most people referred to them). It's a hybrid model that allows one data model will support all BI experiences in SQL Server. Additionally, this will allow for some really neat text infographics

7. Sequence Objects -- For those folks who have worked with Oracle, this has been a long requested feature. A sequence is just an object that is a counter -- a good example of it's use would be to increment values in a table, based a trigger. SQL has always had similar functionality with identity columns, but now this is a discrete object.

8. Enhanced PowerShell Support -- Windows and SQL Server admins should definitely start brushing up on their PowerShell scripting skills. Microsoft is driving a lot of development effort into instrumenting all of their server-based products with PowerShell. SQL 2008 gave DBAs some exposure to it, but there are many more in cmdlets in SQL 2012.

9. Distributed Replay -- Once again this is answer to a feature that Oracle released (Real Application Testing). However, and in my opinion where the real value proposition of SQL Server is, in Oracle it is a (very expensive) cost option to Enterprise Edition. With SQL, when you buy your licenses for Enterprise Edition, you get everything. Distributed replay allows you to capture a workload on a production server, and replay it on another machine. This way changes in underlying schemas, support packs, or hardware changes can be tested under production conditions.

10. PowerView -- You may have heard of this under the name "Project Crescent" it is a fairly powerful self-service BI toolkit that allows users to create mash ups of BI reports from all over the Enterprise.

11. SQL Azure Enhancements -- These don't really go directly with the release of SQL 2012, but Microsoft is making some key enhancements to SQL Azure. Reporting Services for Azure will be available, along with backup to the Windows Azure data store, which is a huge enhancement. The maximum size of an Azure database is now up to 150G. Also Azure data sync allows a better hybrid model of cloud and on-premise solutions

12. Big Data Support -- I saved the biggest for last, introduced at the PASS (Professional Association for SQL Server) conference last year, Microsoft announced a partnership with Hadoop provider Cloudera. One part of this involves MS releasing a ODBC driver for SQL Server that will run on a Linux platform. Additionally, Microsoft is building connectors for Hadoop, which is an extremely popular NoSQL platform. With this announcement, Microsoft has made a clear move into this very rapidly growing space.

Source:


http://mcpmag.com/articles/2012/03/14/top-12-features-of-sql-server-2012.aspx



MSBI (SSIS/SSRS/SSAS) Online Training:



Best Regards,

Ravindra