Skip to main content

Posts

Random Forest Introduction

Random Forest Introduction Random forest is one of popular algorithm which is used for classification and regression as an ensemble learning. It means random forest includes multiple decision trees which the average of the result of each decision tree would be the final outcome for random forest. There are some drawbacks in decision tree such as over fitting on training set which causes high variance, although it was solved in random forest by the aid of Bagging (Bootstrap Aggregating). Now firstly it is better to pay attention to decision tree algorithm and then study about random forest. Because random forest is divided to multitude decision tree. Decision Tree: Decision tree uses tree-like graph to take as best as possible decision by considering all elements of graph. For instance, remember tennis player who has agenda to play in different weather conditions. And now we want to know if player will play on 15th day or not? Finding Pure Branch There are 15 days which in
Recent posts

Connecting SQL Server on Azure VM

Steps for configuring SQL Server connectivity in an Azure VM The connection path is summarized by the following diagram: Create a TCP endpoint for the virtual machine In order to access SQL Server from the internet, the virtual machine must have an endpoint to listen for incoming TCP communication. This Azure configuration step, directs incoming TCP port traffic to a TCP port that is accessible to the virtual machine. Note If you are connecting within the same cloud service or virtual network, you do not have to create a publically accessible endpoint. In that case, you could continue to the next step. For more information, see   Connection Scenarios . 1.       On the Azure Portal, select   Virtual machines (classic) . 2.       Then select you SQL Server virtual machine. 3.      Select   Endpoints , and then click the   Add   button at the top of the Endpoints blade. 4.       On the   Add Endpoint   blade, provide a   Name   such as SQLEndpoint. 5.      

Shrink all SQL Server databases transaction log files

PROBLEM: You want to shrink all SQL Server databases  transaction   log files to clean up some space SOLUTION: Use the following script to shrink all databases transaction log files. You can use it also as a maintenance job using SQL Server Agent DECLARE   @Log_name   VARCHAR ( 1000 ) DECLARE   @Db_name   VARCHAR ( 1000 ) DECLARE   @Recovery_model_desc   VARCHAR ( 1000 ) DECLARE   @SQL   nvarchar ( 2000 ) DECLARE   @ParmDefinition   nvarchar ( 1000 ) DECLARE   @SizeAfter   int DECLARE   db_cursor   CURSOR   FOR SELECT         F . NAME   AS   [LOG_NAME] ,         DB . NAME   AS   [DB_NAME] ,         DB . RECOVERY_MODEL_DESC   AS   [RECOVERY_MODEL_DESC] FROM         MASTER . SYS . MASTER_FILES   F   INNER   JOIN   MASTER . SYS . DATABASES   DB         ON   DB . DATABASE_ID   =   F . DATABASE_ID WHERE   F . FILE_ID = 2  AND   DB . NAME   <>   'tempdb' OPEN   db_cursor           FETCH   NEXT   FROM   db_cursor   INTO   @Log_name ,   @Db_name

SQL Code Review-Tips

General Standards: (Code Format, Naming Conventions, Datatype and Data Length, Syntax): §   Always follow a template in designing stored procedure so that it can easier developer job while designing and integrating. For example each stored procedure should be defined as various blocks such as “Comments Section”, “Variable Declaration”, “Actual Body”, “Audit”, ”Exception Handling”, “Temp_Obj_Removel” and define environment sections if any required. §   Check proper comments are used or not. Always describe procedure, inputs and expected output in comments section. §   Check naming conventions are used properly for procedure name, variables and other internal objects. §   Check all objects used inside the procedure are prefixed with the schema name and column names are referencing with table alias. §   Check all table columns used / mapped are using the correct datatypes and column length. §   Check if all required SET based options enabled are not. §   Check if there ar