Previous Topic: Known Issues and LimitationsNext Topic: Key Bug Fixes


Unreproducible Issues

The following issues have been observed in CA AppLogic® releases but are extremely difficult to reproduce (if at all) and have only been observed once or twice. If any of these issues appear on your grid, please send a bug report to CA describing which problem occurred and which CA AppLogic® commands were executed that led to the failure.

  1. Defect SCR 2842 - Server rebooted due to a crash in the Linux kernel (observed in various releases)

    A server in the grid rebooted on its own due to a crash in the Linux kernel in dom0 of the server. This would not cause the entire grid to fail like in previous CA AppLogic® releases; but could cause application downtime. In such a case CA AppLogic® restarts the appliances that were running on the failed server on other servers in the grid. If this issue is observed on your grid, contact CA Support.

  2. Defect SCR 2834 - Server loses connection to the grid controller

    In CA AppLogic® 2.4, there have been several cases where a server loses connection to the grid controller and reboots. This causes all of the appliances that were running on the server to be rescheduled on other servers in the grid and can also cause application downtime. It is unknown why the servers are losing their connections to the grid controller.

    If the server's connection to the grid controller is dissolved, the server tries to reconnect to the grid controller and if successful, the server remains operational and there is no application downtime. If the server cannot reconnect to the grid controller for one minute, the server is rebooted and application downtime occurs. When a server loses its connection to the grid controller, a message is logged to the dashboard. If this problem is observed, contact CA Support.

  3. SCR 2903 - Volume resize of 4 NTFS volumes executed at the same time failed

    On CA AppLogic®, resizing 4 NTFS volumes at the same time caused all four volume resize operations to fail. This issue has been observed only once.

  4. SCR 3289 - NASR replication failure was observed when almost out of disk space

    While NASR was replicating an 800MB file on a 1GB volume, the NASR appliance became unresponsive. CA is unable to reproduce this issue. If this issue is encountered on your grid, contact CA support.

  5. SCR 3711 - Opening many graphical consoles crashed a server in the grid

    User opened six graphical consoles to different windows appliances running on the grid (opened at the same time). Upon opening the seventh graphical console, one of the servers rebooted and rejoined the grid. The appliances that were running on the failed server were restarted on other servers within the grid. This issue has only been observed once.

BFC Known Problems

We have identified the following known problems with the Backbone Fabric Controller (BFC) in this release:

  1. If you are running a BFC database replica on an NFS hard mounted file system (NFS hard mounts are the default; do not use the optional soft mount functionality), and that NFS-mounted file system fails, the BFC will hang. This issue is a characteristic of NFS itself, and not something the BFC has direct control over. If you end-up in this state and you are unable to restore the NFS file system, you can remove the BFC dependence on that replica to restore normal operation using the following steps:
    1. Log in to the BFC system as root.
    2. Change to the bfcadmin user by typing the following command:
        su - bfcadmin
      
    3. Run <BFC install location>/bin/stop_replication (by default, /opt/bfc/bin/stop_replication)

    Important! After breaking this dependence, your system will be running without replica, so go back into the UI and establish another replica at the same or a different location.

  2. Defect SCR 6990 - Cannot unset the default VLAN for a grid via the BFC API
  3. Defect SCR 6027 - The grid start from BFC UI fails after it is shutdown using “3t grid shutdown” command

    Please do not use the “3t grid shutdown” command on a grid.

  4. Defect SCR 7036 - ESX grid fails due to nfs mount error

    When this occurs, doing a “service nfs restart” on the BFC should resolve the problem.

  5. Defect SCR 7058 - Failed ESX grid node goes into an infinite loop of reboot once it is restarted
  6. Defect SCR 6424 -BMI Install prompts for Driver Disk on HP DL360g4p

    If you get this message, simply hit the “Esc” key to continue the install.

  7. Defect SCR 6779 - Servers that are known to have GigE interfaces sometimes report/fail saying that they are not running at GigE speeds

    In CA AppLogic® 3.5, some Broadcom Corporation NetXtreme II NICs misreport as being too slow. If you get this error, you can try rediscovering the server.

  8. Defect SCR 7296 - BFC: Can't create new grid when the checklist shows Replica DB Space Error.

    If the BFC runs out of space before it is able to shutdown, you will need to restart the BFC once you free-up space for it to function properly again.

  9. Defect SCR 7312 - Unattended install fails with password !"$%&/()=?'

    If you are performing an unattended install with this version of the product, your password cannot contain a “=”

  10. Defect SCR 7376 - STP check is skipped during network detection if server’s public port is configured as trunk

    This bug could occasionally allow servers into grids that should be blocked. If the ports are properly configured, this problem will not be encountered.

  11. Defect SCR 7401 - BFC throws "System_limit" error when total no. of characters in the "Edit Grid Parameters" textbox exceeds 256 characters.

    If you need to use more than 256 characters, simply break those parameters into more than one update of the grid.

  12. Defect SCR 7413 - BFC UI shows incorrect count of CPU cores when Hyper Threading is disabled.

    On some servers, the CPU count reported by the system is the same when Hyper-Threading is disabled as when it is enabled. This has been observed on some Dell R610s.

  13. Defect SCR 7470 - BFC fails to apply grid parameters when more than 1 parameter is passed from the API call.

    This is an issue with how parameters are written to the configuration file passed to aldo set. If in the UI the user enters their data with a comma between entries, the same failure is seen. The work-around for the BFC API is to only pass a single string with a newline separator between the entries.

    For example:

    \"additional_config\":[\"ext_dns1=155.35.34.108\next_dns2=141.202.1.108\"]
    

    Instead of:

    \"additional_config\":[\"ext_dns1=155.35.34.108\",\"ext_dns2=141.202.1.108\"]
    
  14. Defect SCR 7523 – Upgrade of BFC from 3.1 to 3.5 will fail if a grid contains an application IP address range that was selected from a subnet which was subsequently deleted.

    This issue occurs because deleting a subnet in 3.1 does not properly fail if you have grids with application IP address ranges that are in that subnet. The upgrade process looks for the missing subnet on upgrade and then fails because it is missing. For the workaround, use the instructions from the failed upgrade to restore your previous 3.1 BFC installation. Then, go to each of the grids and remove any application IP address ranges from the grid that does not belong to a currently configured subnet. In some cases, such as when you subsequently re-added the same subnet with a new CIDR prefix length parameter, the range may be within the bounds of a current subnet, but the underlying subnet component will be incorrect and still cause an upgrade failure. You should validate that the subnet in the BFC matches the parameters of the application IP address range in the grid controller UI to be sure.

  15. Defect SCR 7047 - Known issue with isotool -o command.

    The isotool -o parameter does not correctly show the USB devices attached to the machine (CentOS 5.5 box). This is a known issue with CentOS 5.5. To resolve this, you must issue the following shell command as user root:

    service haldaemon restart 
    
  16. Known Issue with Fusion Charts in Internet Explorer 9

    If the graphics rendering option is not set correctly in Internet Explorer 9, graphs in the BFC do not display correctly. Affected graphs appear in the BFC Dashboard, Grids, and Servers pages.

    To fix this problem, in Internet Explorer 9 click Internet Options on the Tools menu. Click the Advanced tab and locate the Accelerated graphics section. Select the Use Software Rendering check box. Save your changes and restart IE.

  17. Defect SCR 7707 - During upgrade bfc service hung at starting application 'components' (22 of 45) state.

    This problem can be encountered if you have many VLAN/subnet sets and by-pass the warning during the upgrade that this might be a problem. If you get this warning during the upgrade, please contact Support before proceeding.

  18. Defect SCR 7724 - system with 1000 MACs set to AutoDiscovery (blacklist) mode brings BFC down.

    If you need to set these many MACs to AutoDiscovery (blacklist) mode, it would be best to use Manual Configuration (whitelist) mode with 3.5.

  19. Defect SCR 7765 - inventory fails when no external IP addresses unavailable.

    Please ensure you have external IP addresses available when added servers to a BFC.

  20. Defect SCR 7955 - BMI : Remote Fresh Unattended Installation using PXE Server Failed.

    As part of the process defined for single bare metal ISO creation using the Bare Metal ISO tool, the user needs to place bfcbaremetai.iso and bfcinstall.iso in the directory declared through the ‘rawiso’ parameter in Bare Metal ISO tool project file. The user then needs to assign ‘644’ permission to these ISOs using following commands:

    chmod  644 bfcbaremetal.iso
    
    chmod 644 bfcinstall.iso
    
  21. Defect SCR 7984 - Issues with updation of app IP and controller IP after hitting Reset button

    Due to this bug you cannot directly swap the controller and application IPs with each other in one step. If you must do this, first set them to some other values, then you may set them again to the intended values.

  22. Defect SCR 8004 - After BFC upgrade from 3.1GA, Administration—Networks—External Tab shows incorrect list of available IPs.

    The IP ranges get mapped to the correct VLANs during an upgrade from pre-3.5.1 to 3.5.1 or later except for one case. If you create a grid and later add a network with a VLAN to that grid, the IPs reserved for that VLAN will remain reserved on the default (untagged) network after the upgrade. If you need to recover those specific IPs, please contact Support.

  23. Defect SCR 8005 - API: Issues while adding VLAN to the existing untagged grid via API.

    If you attempt to add a tagged network to an untagged grid via the API, the call will succeed, instead of returning 400 Bad Request.

  24. Missing localization strings in BFC 3.5.2.

    Some of the bug fixes and modifications in BFC 3.5.2 changed a small number of strings used in the BFC in 3.5.2. Those modified strings are few in number and mostly in the area of VLAN management, but they will appear in English regardless of your language of choice.