Friday, 4 September 2015

What are the advantages or drawbacks in choosing Microsoft SQL server over MySQL?

Microsoft SQL Server (version 2008 R2 & 2012)

Pros:

  1. It comes with lots of excellent tools like Sql Server Profiler, SQL Server Management Studio, BI tools and Database Tuning adviser. all will save you a lot of time in development and troubleshooting.
  2. You will find setting up almost everything exceptionally easy.
  3. Its one of Microsoft's premier products so it is well supported with ton of documentation and you will easily find help from MVPs
  4. SQL server has been evolving rapidly in multiple technologies, in Sql Server 2012 they introduced column based indexing which in a way is introduction to NoSQL in sql server.
  5. Sql server 2014 will be coming with memory optimized table (It should not be compared to MEMORY engine of MySQL)
  6. T-SQL remains consistent across new versions of sql server.
  7. Best XML support
  8. SQL Express edition is free and includes almost all functionality of full featured sql server with the limitation of supporting on 10 GB of Database.
  9. There are connection drivers for almost all platform to connect with it.
  10. These are some of the highlights but it has ton of features and list is too long. for further reading Data Center and Data Warehouse Capabilities

Cons:
  1. Its freaking costly. Its priced at $27,494 per processor + $8,592.00 for server + $164.00 for CAL (Client Access Licence)
  2. You have to be in Microsoft's Ecosystem to use sql server. meaning buying their Server products which would again be costly.
  3. You depend on Microsoft for any new feature and Microsoft generally ships updates in 2 years cycle.
  4. Due to now rapid development from Microsoft to get new and latest features you again have to make big investment.

MySQL ( I would be targeting 5.6)

Pros:
  1. Its Free. Though they have Enterprise Edition which is not free and is available at Yearly subscription. In most cases free edition is more than enough.
  2. You have lots of choice. Since MySQL is open source you can get many forks of MySQL. Among the popular is MariaDB which is Drop-in replacement for MySQL.
  3. Tooling support with Workbench is good
  4. phpMyAdmin is absolutely wonderful tool available for it.
  5. Version 5.6 came with Support of NoSQL
  6. Connection drivers available for most of the platforms
  7. Facebook, Google and Oracle are only some of the names among millions of others using MySQL
  8. Since its open source you can get bug fixed by yourself or community will help you get there.
  9. Oracle MySQL has very rapid production cycle
  10. Used in powering 50%+ Web sites in the world.
  11. Biggest community to help you for almost anything in Database

Cons:
  1. One of the Pros is Con also. Too many choice at times. hard to select at times.
  2. Extensive tooling support only available in Enterprise Edition
  3. Sql can have some in-consistency between different versions.
  4. Features are a bit less compared to SQL server
  5. Money saved in cost but sometimes errors are hard to troubleshoot due to lack of proper tools and more time consumed.
  6. MySQL has XML support but lacks lots of features to query over the xml column.

Overall, Both DBMS are great to work with. I have used both in production application and found sql server quite easy to work with in some scenarios.

Advantages and Disadvantages of views in Sql Server

What is view ?

View is the simply subset of table which are stored logically in a database  means a view is a virtual table in the database whose contents are defined by a query. 

To the database user, the view appears just like a real table, with a set of named columns and rows of data. SQL creates the illusion of the view by giving the view a name like a table name and storing the definition of the view in the database.

Views are used for security purpose in databases,views  restricts the user from viewing certain column and rows means by using view we can apply the restriction on accessing the particular rows and columns for specific user. Views display only those data which are mentioned in the query, so it shows only data which is returned by the query that is defined at the time of creation of the View

Advantages of views

Security

Each user can be given permission to access the database only through a small set of views that contain the specific data the user is authorized to see, thus restricting the user's access to stored data

Query Simplicity

A view can draw data from several different tables and present it as a single table, turning multi-table queries into single-table queries against the view.

Structural simplicity

Views can give a user a "personalized" view of the database structure, presenting the database as a set of virtual tables that make sense for that user.

Consistency
A view can present a consistent, unchanged image of the structure of the database, even if the underlying source tables are split, restructured, or renamed.

Data Integrity

If data is accessed and entered through a view, the DBMS can automatically check the data to ensure that it meets the specified integrity constraints.

Logical data independence.

 View can make the application and database tables to a certain extent independent. If there is no view, the application must be based on a table. With the view, the program can be established in view of above, to view the program with a database table to be separated.

Disadvantages of views

Performance

Views create the appearance of a table, but the DBMS must still translate queries against the view into queries against the underlying source tables. If the view is defined by a complex, multi-table query then simple queries on the views may take considerable time.

Update restrictions

When a user tries to update rows of a view, the DBMS must translate the request into an update on rows of the underlying source tables. This is possible for simple views, but more complex views are often restricted to read-only.

SQL Search Tools

Find SQL fast in SQL Server Management Studio
  • Find fragments of SQL in tables, views, stored procedures, functions, views, jobs, and more
  • Quickly navigate to objects wherever they happen to be on a server
  • Search across multiple object types and multiple databases
  • Find all references to an object
  • Search with booleans and wildcards
SQL Search is a free add-in for SQL Server Management Studio that lets you quickly search for SQL across your databases.

Download -->  SQL Search Tools



Clustered and Non-Clustered Index in SQL 2005

1. Introduction

We all know that data entered in the tables are persisted in the physical drive in the form of database files. Think about a table, say Customer (For any leading bank India), that has around 16 million records. When we try to retrieve records for two or three customers based on their customer id, all 16 million records are taken and comparison is made to get a match on the supplied customer ids. Think about how much time that will take if it is a web application and there are 25 to 30 customers that want to access their data through internet. Does the database server do 16 million x 30 searches? The answer is no because all modern databases use the concept of index.

2. What is an Index

Index is a database object, which can be created on one or more columns (16 Max column combination). When creating the index will read the column(s) and forms a relevant data structure to minimize the number of data comparisons. The index will improve the performance of data retrieval and adds some overhead on data modification such as create, delete and modify. So it depends on how much data retrieval can be performed on table versus how much of DML (Insert, Delete and Update) operations.
In this article, we will see creating the Index. The below two sections are taken from my previous article as it is required here. If your database has changes for the next two sections, you can directly go to section 5.

3. First Create Two Tables

To explain these constraints, we need two tables. First, let us create these tables. Run the below scripts to create the tables. Copy paste the code on the new Query Editor window, then execute it.
CREATE TABLE Student(StudId smallint, StudName varchar(50), Class tinyint);
CREATE TABLE TotalMarks(StudentId smallint, TotalMarks smallint);
Go
 
Note that there are no constraints at present on these tables. We will add the constraints one by one.

4. Primary Key Constraint

A table column with this constraint is called as the key column for the table. This constraint helps the table to make sure that the value is not repeated and also no null entries. We will mark the StudId column of the Student table as primary key. Follow these steps:
  1. Right click the student table and click on the modify button.
  2. From the displayed layout, select the StudId row by clicking the Small Square like button on the left side of the row.
  3. Click on the Set Primary Key toolbar button to set the StudId column as primary key column.
IndexIn2005/Pic01.JPG
Now this column does not allow null values and duplicate values. You can try inserting values to violate these conditions and see what happens. A table can have only one Primary key. Multiple columns can participate on the primary key column. Then, the uniqueness is considered among all the participant columns by combining their values.

5. Clustered Index

The primary key created for the StudId column will create a clustered index for the Studid column. A table can have only one clustered index on it.
When creating the clustered index, SQL server 2005 reads the Studid column and forms a Binary tree on it. This binary tree information is then stored separately in the disc. Expand the table Student and then expand the Indexes. You will see the following index created for you when the primary key is created:
IndexIn2005/Pic02.jpg With the use of the binary tree, now the search for the student based on the studid decreases the number of comparisons to a large amount. Let us assume that you had entered the following data in the table student:
IndexIn2005/Pic03.jpg
The index will form the below specified binary tree. Note that for a given parent, there are only one or two Childs. The left side will always have a lesser value and the right side will always have a greater value when compared to parent. The tree can be constructed in the reverse way also. That is, left side higher and right side lower.
IndexIn2005/Pic04.JPG Now let us assume that we had written a query like below:
Select * from student where studid = 103;
Select * from student where studid = 107;
Execution without index will return value for the first query after third comparison.
Execution without index will return value for the second query at eights comparison.
Execution of first query with index will return value at first comparison.
Execution of second query with index will return the value at the third comparison. Look below:
  1. Compare 107 vs 103 : Move to right node
  2. Compare 107 vs 106 : Move to right node
  3. Compare 107 vs 107 : Matched, return the record
If numbers of records are less, you cannot see a different one. Now apply this technique with a Yahoo email user accounts stored in a table called say YahooLogin. Let us assume there are 33 million users around the world that have Yahoo email id and that is stored in the YahooLogin. When a user logs in by giving the user name and password, the comparison required is 1 to 25, with the binary tree that is clustered index.
Look at the above picture and guess yourself how fast you will reach into the level 25. Without Clustered index, the comparison required is 1 to 33 millions.

The above explanation is for easy understanding. Now a days SQL server is using the B-Tree techniques to represent the clustered index.
Got the usage of Clustered index? Let us move to Non-Clustered index.


6. Non Clustered Index
A non-clustered index is useful for columns that have some repeated values. Say for example, AccountType column of a bank database may have 10 million rows. But, the distinct values of account type may be 10-15. A clustered index is automatically created when we create the primary key for the table. We need to take care of the creation of the non-clustered index.
Follow the steps below to create a Non-clustered index on our table Student based on the column class.
  1. After expanding the Student table, right click on the Indexes. And click on the New Index. IndexIn2005/Pic05.jpg
  2. From the displayed dialog, type the index name as shown below and then click on the Add button to select the column(s) that participate in the index. Make sure the Index type is Non-Clustered. IndexIn2005/Pic06.jpg
  3. In the select column dialog, place a check mark for the column class. This tells that we need a non-clustered index for the column Student.Class. You can also combine more than one column to create the Index. Once the column is selected, click on the OK button. You will return the dialog shown above with the selected column marked in blue. Our index has only one column. If you selected more than one column, using the MoveUp and MoveDown button, you can change order of the indexed columns. When you are using the combination of columns, always use the highly repeated column first and more unique columns down in the list. For example, let use assume the correct order for creating the Non-clustered index is: Class, DateOfBirth, PlaceOfBirth. IndexIn2005/Pic07.jpg
  4. Click on the Index folder on the right side and you will see the non-clustered index based on the column class is created for you. IndexIn2005/Pic08.jpg

7. How Does a Non-Clustered Index Work?

A table can have more than one Non-Clustered index. But, it should have only one clustered index that works based on the Binary tree concept. Non-Clustered column always depends on the Clustered column on the database.
This can be easily explained with the concept of a book and its index page at the end. Let us assume that you are going to a bookshop and found a big 1500 pages of C# book that says all about C#. When you glanced at the book, it has all beautiful color pages and shiny papers. But, that is not only the eligibility for a good book right? One you are impressed, you want to see your favorite topic of Regular Expressions and how it is explained in the book. What will you do? I just peeped at you from behind and recorded what you did as below:
  1. You went to the Index page (it has total 25 pages). It is already sorted and hence you easily picked up Regular Expression that comes on page Number 17.
  2. Next, you noted down the number displayed next to it which is 407, 816, 1200-1220.
  3. Your first target is Page 407. You opened a page in the middle, the page is greater than 500.
  4. Then you moved to a somewhat lower page. But it still reads 310.
  5. Then you moved to a higher page. You are very lucky you exactly got page 407. [Yes man you got it. Otherwise I need to write more. OK?]
  6. That’s all, you started exploring what is written about Regular expression on that page, keeping in mind that you need to find page 816 also.
In the above scenario, the Index page is Non-Clustered index and the page numbers are clustered index arranged in a binary tree. See how you came to the page 407 very quickly. Your mind actually traversed the binary tree way left and right to reach the page 407 quickly.
Here, the class column with distinct values 1,2,3..12 will store the clustered index columns value along with it. Say for example; Let us take only class value of 1. The Index goes like this:
1: 100, 104, 105
So here, you can easily get all the records that have value for class = 1. Map this with the Book index example now. See you all in the next article.