SAN Interview questions





1. WHAT ARE THE BENEFITS OF FIBRE CHANNEL SANS?


Fibre Channel SANs are the de facto standard for storage networking in the corporate data center because they provide exceptional reliability, scalability, consolidation, and performance. Fibre Channel SANs provide significant advantages over direct-attached storage through improved storage utilization, higher data availability, reduced management costs, and highly scalable capacity and performance.


2. WHAT ENVIRONMENT IS MOST SUITABLE FOR FIBRE CHANNEL SANS?




Typically, Fibre Channel SANs are most suitable for large data centers running business-critical data, as well as applications that require high-bandwidth performance such as medical imaging, streaming media, and large databases. Fibre Channel SAN solutions can easily scale to meet the most demanding performance and availability requirements.



3. WHAT CUSTOMER PROBLEMS DO FIBRE CHANNEL SANS SOLVE?



The increased performance of Fibre Channel enables a highly effective backup and recovery approach, including LAN-free and server-free backup models. The result is a faster, more scalable, and more reliable backup and recovery solution. By providing flexible connectivity options and resource sharing, Fibre Channel SANs also greatly reduce the number of physical devices and disparate systems that must be purchased and managed, which can dramatically lower capital expenditures. Heterogeneous SAN management provides a single point of control for all devices on the SAN, lowering costs and freeing personnel to do other tasks.



4. HOW LONG HAS FIBRE CHANNEL BEEN AROUND?



Development started in 1988, ANSI standard approval occurred in 1994, and large deployments began in 1998. Fibre Channel is a mature, safe, and widely deployed solution for high-speed (1 GB, 2 GB, 4 GB) communications and is the foundation for the majority of SAN installations throughout the world.



5. WHAT IS THE FUTURE OF FIBRE CHANNEL SANS?



Fibre Channel is a well-established, widely deployed technology with a proven track record and a very large installed base, particularly in high-performance, business-critical data center environments. Fibre Channel SANs continue to grow and will be enhanced for a long time to come. The reduced costs of Fibre Channel components, the availability of SAN kits, and the next generation of Fibre Channel (4 GB) are helping to fuel that growth. In addition, the Fibre Channel roadmap includes plans to double performance every three years



6. WHAT ARE THE BENEFITS OF 4GB FIBRE CHANNEL?



Benefits include twice the performance with little or no price increase, investment protection with backward compatibility to 2 GB, higher reliability due to fewer SAN components (switch and HBA ports) required, and the ability to replicate, back up, and restore data more quickly. 4 GB Fibre Channel systems are ideally suited for applications that need to quickly transfer large amounts of data such as remote replication across a SAN, streaming video on demand, modeling and rendering, and large databases. 4 GB technology is shipping today.



7. HOW IS FIBRE CHANNEL DIFFERENT FROM ISCSI?



Fibre Channel and iSCSI each have a distinct place in the IT infrastructure as SAN alternatives to DAS. Fibre Channel generally provides high performance and high availability for business-critical applications, usually in the corporate data center. In contrast, iSCSI is generally used to provide SANs for business applications in smaller regional or departmental data centers.



8. WHEN SHOULD I DEPLOY FIBRE CHANNEL INSTEAD OF ISCSI?



For environments consisting of high-end servers that require high bandwidth or data center environments with business-critical data, Fibre Channel is a better fit than iSCSI. For environments consisting of many midrange or low-end servers, an IP SAN solution often delivers the most appropriate price/performance.



9. Name some of the SAN topologies


Point-to-point, arbitrated loop, and switched fabric topologies




10. What’s the need for separate network for storage why LAN cannot be used?


LAN hardware and operating systems are geared to user traffic, and LANs are tuned for a fast user response to messaging requests.
With a SAN, the storage units can be secured separately from the servers and totally apart from the user network enhancing storage access in data blocks (bulk data transfers), advantageous for server-less backups.



11. What are the advantages of RAID?


“Redundant Array of Inexpensive Disks”
Depending on how we configure the array, we can have the These can be used in combination to deliver the balance of performance and reliability that the user requires.



12. Define RAID? Which one you feel is good choice?



RAID (Redundant array of Independent Disks) is a technology to achieve redundancy with faster I/O. There are Many Levels of RAID to meet different needs of the customer which are: R0, R1, R3, R4, R5, R10, R6.
Generally customer chooses R5 to achieve better redundancy and speed and it is cost effective.




13. What is the difference between RAID 0+1 and RAID 1+0






14. When JBOD's are used?


“Just a Bunch of Disks”
It is a collection of disks that share a common connection to the server, but don’t include the mirroring,
striping, or parity facilities that RAID systems do, but these capabilities are available with host-based software.



15. Differentiate RAID & JBOD?



RAID: “Redundant Array of Inexpensive Disks”
Fault-tolerant grouping of disks that server sees as a single disk volume
Combination of parity-checking, mirroring, striping
Self-contained, manageable unit of storage

JBOD: “Just a Bunch of Disks”
Drives independently attached to the I/O channel
Scalable, but requires server to manage multiple volumes
Do not provide protection in case of drive failure



16. What is a HBA?


Host bus adapters (HBAs) are needed to connect the server (host) to the storage.



17. What are the advantages of SAN?





18. What is the difference b/w SAN and NAS?



The basic difference between SAN and NAS, SAN is Fabric based and NAS is Ethernet based.
SAN - Storage Area Network
  It accesses data on block level and produces space to host in form of disk.

NAS - Network attached Storage
  It accesses data on file level and produces space to host in form of shared network folder.



19. What is a typical storage area network consists of - if we consider it for implementation in a small business setup?


If we consider any small business following are essentials components of SAN



20. Can you briefly explain each of these Storage area components?





21. What is the most critical component in SAN?


Each component has its own criticality with respect to business needs of a company.



22. How is a SAN managed?



There are many management software’s used for managing SAN's to name a few



23. Which one is the Default ID for SCSI HBA?



Generally the default ID for SCSI HBA is 7.
    SCSI- Small Computer System Interface
     HBA - Host Bus Adaptor



24. What is the highest and lowest priority of SCSI?


There are 16 different ID’s which can be assigned to SCSI device 7, 6, 5, 4, 3, 2, 1, 0, 15, 14, 13, 12, 11, 10, 9, 8.
Highest priority of SCSI is ID 7 and lowest ID is 8.



25. How do you install device drivers for the HBA first time during OS installation?



In some scenarios you are supposed to install Operating System on the drives connected thru SCSI HBA or SCSI RAID Controllers, but most of the OS will not be updated with drivers for those controllers, that time you need to supply drivers externally, if you are installing windows, you need to press F6 during the installation of OS and provide the driver disk or CD which came along with HBA.
If you are installing Linux you need to type "linux dd" for installing any driver.



26. What is Array?



Array is a group of Independent physical disks to configure any Volumes or RAID volumes.



27. Can u describe at-least 3 troubleshooting scenarios which you have come across in detail ?



SCENARIO 1: How do you find/debug when there is error while working SCSI devices?

In our daily SAN troubleshooting there are many management and configuration tools we use them to see when there is a failure with target device or initiator device.
Some time it is even hard to troubleshoot some of the things such as media errors in the drives, or some of the drives taking long time to spin-up. In such cases these utilities will not come to help. To debug this kind of information most of the controller will be implemented with 3-pin serial debug port. With serial port debug connector cable you can collect the debug information with hyper terminal software.

SCENARIO 2: I am having an issue with a controller its taking lot of time to boot and detect all the drives connected how can I solve this.?

There are many possibilities that might cause this problem. One of the reason might be you are using bad drives that cannot be repaired. In those cases you replace the disks with working ones.

Another reason might be slots you connected your controller to a slot which might not be supported.
Try to connect with other types of slots.

One more probable reason is if you have flashed the firmware for different OEM’s on the same hardware.
To get rid of this the flash utilities will be having option to erase all the previous and EEPROM and boot block entry option. Use that option to rectify the problem.

SCENARIO 3: I am using tape drive series 700X, even the vendor information on the Tape drive says 700X, but the POST information while booting the server is showing as 500X what could be the problem?

First you should make sure your hardware is of which series, you can find out this in the product website.
Generally you can see this because in most of the testing companies they use same hardware to test different series of same hardware type. What they do is they flash the different series firmware. You can always flash back to exact hardware type.



28. Which are the SAN topologies?



SAN can be connected in 3 types which are mentioned below:


29. Which are the 4 types of SAN architecture types


  1. Core-edge
  2. Full-Mesh
  3. Partial-Mesh
  4. Cascade



30. Which command is used in linux to know the driver version of any hardware device?


dmesg



31. How many minimum drives are required to create R5 (RAID 5) ?


You need to have at least 3 disk drives to create R5.



32. Can you name some of the states of RAID array?



There are states of RAID arrays that represent the status of the RAID arrays which are given below


33. Name the features of SCSI-3 standard?


QAS: Quick arbitration and selection
Domain Validation
CRC: Cyclic redundancy check



34. Can we assign a hot spare to R0 (RAID 0) array?


No, since R0 is not redundant array, failure of any disks results in failure of the entire array so we cannot rebuild the hot spare for the R0 array.



35. Can you name some of the available tape media types?



There are many types of tape media available to back up the data some of them are



36. Briefly list the advantages of SAN?


SANs fully exploit high-performance, high connectivity network technologies
SANs expand easily to keep pace with fast growing storage needs
SANs allow any server to access any data
SANs help centralize management of storage resources
SANs reduce total cost of ownership (TCO).



37. What is HA?



HA High Availability is a technology to achieve failover with very less latency. Its a practical requirement of data centers these days when customers expect the servers to be running 24 hours on all 7 days around the whole 365 days a year - usually referred as 24x7x365. So to achieve this, a redundant infrastructure is created to make sure if one database server or if one app server fails there is a replica Database or Appserver ready to take-over the operations. End customer never experiences any outage when there is a HA network infrastructure.



38. What is virtualization?



Virtualization is logical representation of physical devices. It is the technique of managing and presenting storage devices and resources functionally, regardless of their physical layout or location. Virtualization is the pooling of physical storage from multiple network storage devices into what appears to be a single storage device that is managed from a central console. Storage virtualization is commonly used in a storage area network (SAN). The management of storage devices can be tedious and time-consuming. Storage virtualization helps the storage administrator perform the tasks of backup, archiving, and recovery more easily, and in less time, by disguising the actual complexity of the SAN.



39. Describe in brief the composition of FC Frame?





40. What is storage virtualization?


Storage virtualization is amalgamation of multiple n/w storage devices into single storage unit.



41. What are the protocols used in physical/datalink and network layer of SAN?


  1. Ethernet
  2. SCSI
  3. Fibre Channel



42. What are the types of disk array used in SAN?


  1. JBOD
  2. RAID



43. What are different types of protocols used in transportation and session layers of SAN?


  1. Fibre Channel Protocol (FCP)
  2. Internet SCSI (iSCSI)
  3. Fibre Channel IP (FCIP)



44. What is the type of Encoding used in Fibre Channel?


8b/10b, as the encoding technique is able to detect all most all the bit errors



45. How many classes of service are available in Fibre Channel?


7 Classes of service are available in Fibre Channel



46. What are the main constrains of SCSI in storage networking?


  1. Deployment distance (max. of 25 mts)
  2. Number of devices that can be interconnected (16)



47. What is a Fabric?


Interconnection of Fibre Channel Switches



48. What are the services provided by Fabric to all the nodes?


  1. Fabric Login
  2. SNS
  3. Fabric Address Notification
  4. Registered state change notification
  5. Broadcast Servers



49. What is the difference between LUN and WWN?





50. What are the different topologies in Fibre Channel?


  1. Point-to-Point
  2. Arbitrary Loop
  3. Switched Fabric Loop



51. What are the layers of Fibre Channel Protocol?


  1. FC Physical Media
  2. FC Encoder and Decoder
  3. FC Framing and Flow control
  4. FC Common Services
  5. FC Upper Level Protocol Mapping



52. What is zoning?


Fabric management service that can be used to create logical subsets of devices within a SAN. This enables portioning of resources for management and access control purpose.



53. What are the two major classification of zoning?


Two types of zoning are
  1. Software Zoning
  2. Hardware Zoning



54. What are different levels of zoning?


  1. Port Level zoning
  2. WWN Level zoning
  3. Device Level zoning
  4. Protocol Level zoning
  5. LUN Level zoning



55. What are the 3 prominent characteristics of SAS Protocol?


  1. Native Command Queuing (NCQ)
  2. Port Multiplier
  3. Port Selector



56. What are the 5 states of Arbitrary Loop in FC?


  1. Loop Initialization
  2. Loop Monitoring
  3. Loop arbitration
  4. Open Loop
  5. Close Loop



57. How does FC Switch maintain the addresses?


FC Switch uses simple name server (SNS) to maintain the mapping table



58. What is the purpose of disk array?


Probability of unavailability of data stored on the disk array due to single point failure is totally eliminated.




59. What is disk array?


Set of high performance storage disks that can store several terabytes of data. Single disk array can support multiple points of connection to the network.



60. What is virtualization?


A technique of hiding the physical characteristics of computer resources from the way in which other system application or end user interact with those resources. Aggregation, spanning or concatenation of the combined multiple resources into larger resource pools.



61. What is Multipath I/O?


Fault tolerant technique where, there is more than one physical path between the CPU in the computer systems and its main storage devices through the buses, controllers, switches and other bridge devices connecting them.



62. What is RAID?


Technology that groups several physical drives in a computer into an array that you can define as one or more logical drive. Each logical drive appears to the operating system as single drive. This grouping enhances the performance of the logical drive beyond the physical capability of the drives.



63. What is stripe-unit-size?


It is data distribution scheme that complement s the way operating system request data. Granularity at which data is stored on one drive of the array before subsequent data is stored on the next drive of the array. Stripe unit size should be close to the size of the system I/O request.



64. What is LUN Masking?


A method used to create an exclusive storage area and access control. And this can be achieved by storage device control program.



65. What is the smallest unit of information transfer in FC?


Frame



66. How is the capacity of the HDD calculated?


Number of Heads X Number of Cylinders X Sectors per track X Sector Size



67. What is bad block reallocation?


A bad sector is remapped or reallocated to good spare block and this information is stored in the internal table on the hard disk drive. The bad blocks are identified during the media test of the HDD as well as during various types of read write operations performed during the I/O tests. Apart from the new generation of HDD comes with a technology called BGMS (background media scan) which continuously scans the HDD media for defects and maps them when the drive is idle (this is performed after the HDD is attached to the system).



68. What are two types of recording techniques on the tapes?


  1. Linear Recording
  2. Helical Scan Recording.



69. What is snapshot?


A snapshot of data object contains an image of data at a particular point of time.



70. What is HSM?


Hierarchical storage management - An application that attempts to match the priority of data with the cost of storage.



71. What is Hot-swapping?


Devices are allowed to be removed and inserted into a system without turning off the system.



72. What is Hot-Sparing?


A spare device is available to be inserted into the subsystem operation without having to remove and replace a device.



73. What are different types of backup system?





74. What is the different between mirroring, Routing and multipathing?



Redundancy Functions Relationships Role



75. Name few types of Tape storage?





76. What is a sequence in FC?


Group of one or more frames that encompasses one or more “information units” of a upper layer protocol.
Example:
It requires
  1. One sequence to transfer the command
  2. One or more sequence to transfer the data
  3. Once sequence to transfer the status.



77. What is Exchange in FC?


Exchange is to establish a relationship between 2 N_PORTs and then these two ports transfer data via one or more sequence within this relationship.
Example: Exchange exist to transfer the command, data and the status of one SCSI task



78. Why do we need Login in FC?


Port Login: To exchange service parameters between N_Ports and N_Ports
Process Login: To establish the SCSI operating environment between two N_PORTS
Fabric Login: Similar to port login, FLOGI is an extended link service command that sets up a session between two participants. With FLOGU a session is created between an N_Port or NL_Port and the switch.



79. What are the different types of clusters?





80. What are three levels of management in storage?


  1. Storage Level Management
  2. Network Level Management
  3. Enterprise Level Management



81. What are the key activities in SAN management?





82. What is the difference between HBA and NIC?


HBA => Host bus adapters are used in storage based traffic while NIC (Network Interface Cards are used in IP based LAN traffic.



83. What is the measuring unit of data activity?


Gigabits per second (Gb/ps)



84. What are the basic storage policies?





85. What is bypass circuitry?


A circuit that automatically removes the storage device from the data path (FC device out of FC AL loop) when signaling is lost (this signal is called port by-pass signal).



86. How many connections are possible in Fabric topology?


2^24 (24 bit address to the port), and the largest possible fabric will have 239 interconnected switches.



87. What is one of the constrain of using storage switch?


Latency



88. What is the difference between NAS and SAN?



NAS SAN


89. What is Jitter?


Jitter refers to any deviation in timing that a bit stream suffers as it traverses the physical medium and the circuitry on-board the end devices. A certain amount of deviation from the original signaling will occur naturally as serial bit stream propagates over fibre-optic or copper cabling. Mainly caused by electro-magnetic interference



90. What is BER (Bit error rate)?


Probability that a transmitted bit will be erroneously received is the measure of number of bits (erroneous) at the output of the receiver and dividing by the total number of bits in transmission.



91. What is WWPN?


WWPN is the 16bit character that is assigned to the port, SAN volume controller uses it to uniquely identify the fibre channel HBA that is installed in the host system.



92. What is connection allegiance?


Given multiple connections are established, individual command/response pair must flow over the same connection. This connection allegiance ensures that specific read or writes commands are fulfilled without any additional overhead of monitoring multiple connections and to see whether a particular request is completed.



93. What is burst Length?


The burst length is the number of bytes that the SCSI initiator sends to the SCSI target in the FCP_DATA sequence.



94. What is NAS in detail?


NAS or Network Attached Storage
“NAS is used to refer to storage elements that connect to a network and provide file
access services to computer systems. A NAS Storage Element consists of an interface or engine, which implements the file services, and one or more devices, on which data is stored. NAS elements may be attached to any type of network. When attached to SANs, NAS elements may be considered to be members of the SAS (SAN Attached Storage) class of storage elements.

A class of systems that provide file services to host computers. A host system that uses network attached storage uses a file system device driver to access data using file access protocols such as NFS or CIFS. NAS systems interpret these commands and perform the internal file and device I/O operations necessary to execute them.

Though the NAS does speed up bulk transfers, it does not offload the LAN like a SAN does. Most storage devices cannot just plug into gigabit Ethernet and be shared - this requires a specialized file server the variety of supported devices is more limited.NAS has various protocols established for such needed features as discovery, access control, and name services.



95. iSCSI fundamentals


iSCSI is a protocol defined by the Internet Engineering Task Force (IETF) which enables SCSI commands to be encapsulated in TCP/IP traffic, thus allowing access to remote storage over low cost IP networks.



96. What advantages would using an iSCSI Storage Area Network (SAN) give to your organization over using Direct Attached Storage (DAS) or a Fibre Channel SAN?






Source