Previous Topic: Known Issues and Important NotesNext Topic: Unreproducible Issues


Known Issues and Limitations

This section describes the known issues and limitations at this time.

CA AppLogic Limitations
  1. New for 3.7: The included HalSign TurboGate PV Drivers v3.0.1 does not install on Windows 8 32-bit. It does install correctly on Windows 8 64-bit.
  2. New for 3.7: Sample applications from older releases will fail to start on a 3.7 grid due to new resource requirements for appliances in the system and dynamic catalogs (due to these appliances being re-based on CentOS 6.3). Please use the updated sample applications provided with the 3.7 release (the data volumes from the older sample applications can be migrated over to an instance of the newer sample applications).
  3. New for 3.7: In order to use Windows-based appliances on ESX-based grids, the maintainer must import the system_ms-ESX-1.1.9.tar catalog into the grid. Note that the default system_ms-1.1.10.tar catalog is for Xen only and will not work on ESX. This is due to the new Windows read-only volume support that was added only for Xen-based grids and is not supported on ESX-based grids.
  4. New for 3.7: For ESX-based grids, Windows 8 and 2012 Server are not supported. This is due to a limitation with ESX 4.x which is used by CA AppLogic. However, Windows 7 is supported for ESX-based grids. For Xen-based grids, all Windows flavors are supported (Windows 2003/2008/2012 Server, Windows 7 and 8).
  5. New for 3.7: INSSLR2 does not support secondary IP addresses (it never has in any previous release). This is an enhancement that may be included in a future release.
  6. Grid size is limited to 128 servers per grid for Xen, and 80 servers for ESX
    This is a limitation of the current CA AppLogic release. This release has been certified up to 30 servers; however configurations up to 128 (for Xen) and 80 (for ESX) servers are supported.
  7. CA AppLogic currently cannot be installed on servers with 3 TB or larger drives.
  8. For proper operation of the user interface, enable JavaScript, pop-ups, and cookies for the grid controller host. Ensure your browser is updated with the latest available version to correct defects that affect AJAX applications.
  9. Protocols are not enforced on appliance terminals, only endpoints are enforced.

    This means that an appliance can only talk to appliances connected to it (plus its own server and the grid controller). Nevertheless, protocols on new appliances should be properly specified to help ensure application design integrity and compatibility with future versions of CA AppLogic.

  10. The total available disk space does not take volume mirroring into account

    The total available disk space reported by the grid info command is a raw estimate and does not take volume mirroring into account. The true available disk space is the reported available amount divided by the number of mirrors (2 mirrors by default). For example, if there is 1000GB of available disk space and the grid was configured for mirroring of 2, the available disk space is 500GB. Also, to successfully mirror volumes, there must be enough disk space on at least X servers where X is the number of mirrors (CA AppLogic will not fail to create a volume if any one of its mirrors cannot be created, it will display a warning that the volume could not be mirrored).

  11. A server failure during application start may cause the application start to fail

    If an application is started and one of the grid's servers fails, the application start will fail if one or more of the application's appliances were scheduled to run on the failed server. If this situation occurs, simply restart the application.

  12. The volume management GUI available using the filers has an upload limitation of 10MB per file

    To upload larger files to your volume, use the vol manage shell command; don't forget to specify the external IP settings for this command to enable remote access from within the volume manager. For more information, see the reference for the vol manage command.

  13. The property markup for appliance configuration is only supported for the volfix configuration mode

    The new dhcp configuration mode does not support the property markup for appliance configuration. When porting appliances from volfix to dhcp configuration modes, the APK documentation describes how to deal with appliances that depend upon the property markup for appliance configuration. See the Appliance Kit (APK) for more information.

  14. Validation flags don't appear if the application is opened in read-only mode

    To see the validation flags for an application, open the application in edit mode. The validation flags are used to flag appliances that do not have all of their mandatory properties/terminals/volumes properly configured.

  15. All of the appliances that are distributed with CA AppLogic are missing their GUI/desktop packages/support (X11, Gnome Desktop, etc.)

    Therefore, the graphical console cannot be used with these appliances. This is done on purpose to make the appliances as compact as possible. Using the new iso2class utility, users may create their own appliances with full desktop support.

  16. Running more than 1 Windows application with appliances having the same instance name results in a duplicate computer name error from Windows

    This error is due to the fact that CA AppLogic sets the computer name of an appliance to its instance name. Therefore, if you have more than 1 appliance running on a grid that all have the same instance names, the duplicate name error will be displayed in Windows on the graphical console. This error is simply a warning and does not affect the grid or its operation. However, if you need to use Windows as a domain controller, you will need to set the computer names to unique names for each appliance. You may use the wincfg utility to set the computer name in your appliance.

  17. The graphical console requires the latest version of Java in your IE/FF browser

    If the latest version of Java is not used, the graphical console may not work correctly (it will hang while trying to load). Before reporting graphical console errors to CA, be sure to verify that you are using the latest Java version (if you need to upgrade java in your browser, be sure to re-open your browser afterwards for the graphical console to work correctly).

  18. Failover groups may not be satisfied upon controller recovery

    When a secondary server takes over as the new primary server, if there are not enough resources available on the server to start the grid controller, CA AppLogic restarts appliances which are running on the new primary server on other servers within the grid so the grid controller can be started on the new primary server. Note that this may break appliance failover groups. If CA AppLogic stops one of these appliances it may not be able to restart the appliance on another server because there may not be enough resources to satisfy the failover group.

  19. Xen-HVM-based appliances use more memory than their configured amount

    All HVM-based appliances (Windows, etc.) use more memory on the server than what they are configured to use. Typically, depending upon the amount of memory assigned to an HVM-based appliance, the appliance uses additional memory on the server in which it is running (this additional memory is required by the virtualization hypervisor running on the servers and is known as shadow memory). Therefore it is possible that even though a server might have enough available memory as compared to what is assigned for the appliance, the appliance will not be able to run on that server due to the additional shadow memory needed for HVM-based appliances that is not available on the server. The CA AppLogic scheduler does take this extra shadow memory into account when scheduling appliances during application start.

  20. Internet Explorer cannot be used to access the graphical console of an appliance that is based on Ubuntu

    Any other browser may be used instead.

  21. The shared interface support in CA AppLogic 3.x will not work on Windows-based appliances

    Shared interfaces should work with all other operating systems.

  22. If any server of a grid has greater than 4 TB of local storage, the dom0 memory for the entire grid must be set to 1 GB. This is specified through the BFC using the parameter dom0_vm_mb=1024 when you install the grid.
Known Problems and Issues

The following are the known issues in this release:

Severity 1
Severity 2
Severity 3
  1. Defect SCR 8243 - vol resize/app provision/GUI network errors while the grid and the grid controller are under heavy load

    While the grid and the grid controller itself is under heavy load, it is possible for various grid controller commands (app provision/vol resize for example) to fail and network errors in the GUI to occur. If this issue is encountered, increase the grid controller CPU to 1 and the memory to at least 2GB and this should workaround the issue.

  2. Defect SCR 8258 - On ESX-based grids that are under heavy load, occasionally vol resize operations will fail or hang. If this issue is encountered, re-execute the volume operation.

    This issue will be fixed in a subsequent release.

  3. Defect SCR 8178 - Pinned appliance fails to start on other available servers after max VM/server limit is reached (on the server to which the appliance is pinned).

    In order to work around this issue, unpin the appliance and restart the application. This issue will be fixed in a subsequent release.

  4. Defect SCR 8158 - Vol Manage: Unable to manage two ext3-snapshot volumes

    Currently the filer does not support managing two ext3-snapshot volumes at the same time. This issue will be fixed in a future release.

  5. Reduced Performance When Using HP Smart Array RAID Controller Without Write Cache

    When using the HP Smart Array RAID controller without the write cache enabled, there is a 50% reduction in performance. This issue has been verified on a HP DL 580 G7 Server, with Smart Array P410i 256mb. These cards require a battery or capacitor to be installed to enable the write cache.

  6. Ensure the SR-IOV BIOS setting on Emulex 10G NICs is DISABLED

    When using ServerEngines Corp. Emulex OneConnect 10Gb NIC (be3) (rev 01) NICs with CA AppLogic, these NICs incorrectly bounce packets if the SR-IOV BIOS option is enabled. These bounced packets alter the bridge's forwarding cache, causing the bridge to drop packets instead of forwarding them to the correct destination. This causes instability in CA AppLogic which results in intermittent application start failures. Therefore, please ensure that the SR-IOV BIOS setting is DISABLED for all Emulex 10G NICs on all servers within the grid.

  7. Defect SCR 2203 - Stuck volume mount causes failures to start applications (rare, cannot reproduce)

    Very rarely an application will fail to start due to a stuck volume mount on one of the servers. CA AppLogic detects stuck volume mounts and reports them to the user on the grid's dashboard. If this problem occurs on your grid, notify CA Support. Optionally, disabling the server or rebooting the server that has the stuck mounts will resolve this issue.

  8. Defect SCR 3416 - Grid controller does not automatically recover if the primary server becomes unresponsive

    If this situation occurs, rebooting the primary server will restore the grid to an operational state. Note that this issue has not been observed in CA AppLogic 3.5 or 3.7.

  9. Defect SCR 2239 - CAT: VDS: security vulnerability: initial user/password setup
  10. Defect SCR 1471 - GUI times out and logs out the user while there is load on the grid controller

    The GUI no longer automatically logs the user out when there is heavy load on the grid controller. Instead, the user will receive a message stating that there was a network error. In this case however, the GUI is still fully functional. The network error message will only be received when there is heavy load on the controller, such as starting 4 applications at the same time AND copying a large multi-GB volume. In large grids, try assigning up to a full CPU core and 1GB RAM to the controller.

  11. Defect SCR 857 - Grid reboot may degrade one or more system volumes

    If a grid is rebooted using the grid reboot command, when the grid comes back up after the reboot, one or more of the system volumes may become degraded. CA AppLogic automatically repairs these volumes as highest priority.

  12. Defect SCR 1199 - Unable to migrate a volume whose streams are all on disabled servers

    When migrating a volume, verify that at least one of its streams is on an enabled server or else the migration command will fail. The volume can be completely migrated off of its original set of servers by migrating the volume twice.

  13. Defect SCR 1496 - Grid automatic application recovery (HA) may fail due to servers taking too long to reboot

    Some physical servers may take a long time to reboot - this may cause CA AppLogic's automated grid recovery to fail. The end result of this is that applications may not be all restarted automatically after the grid recovers from a failure. This is due to the grid controller waiting for a maximum of 10 minutes for all servers to reboot and reconnect to the grid controller (which may not be enough time for all servers to reboot). Workaround is to manually restart applications after all servers have reconnected to the grid controller - execute "list srv" to help ensure that all servers are connected to the grid controller - they all should be in the UP state. In CA AppLogic 2.1, with server boot timeout of 10 minutes, this may occur primarily if a server fails to boot due to hardware or BIOS malfunction.

  14. Defect SCR 1234 - Grid flapping file is not always reset when the operator intentionally reboots the grid

    When the operator reboots the grid, the grid flapping state is supposed to be reset and a message should be displayed on the dashboard stating that the operator rebooted the grid intentionally ("Grid has been restarted by operator on ..."). Occasionally when rebooting the grid, the grid file is not reset nor is the dashboard message displayed. The only problem that this may cause is upon the next grid failure, the applications may not be automatically restarted (depending on how many times the grid has failed when this bug occurs). To workaround this problem, if after an intentional grid reboot there is no dashboard message displayed, contact CA Support to have the grid flapping state reset on your grid.

  15. Defect SCR 1360 - Appliance shows slightly less memory and less disk size than allocated

    The reason for the slightly reduced resources is related to allocation for service areas. For memory, it is likely due to Xen related to the memory map table for a virtual machine. For disk, it is due to normal file system service areas (this is the same as on regular Linux servers).

  16. Defect SCR 2293 - Occasionally opening an application in the editor results in a message that the application is locked for editing

    In this case, the application is not opened for editing by any other user but the CA AppLogic editor erroneously thinks somebody else has the application open for editing. If this occurs, simply override the application lock when prompted by the editor upon opening the application.

  17. Defect SCR 2313 - IE is about 2x slower than FireFox/Chrome/Safari when using the CA AppLogic GUI

    The main slowdown occurs when opening an application in the CA AppLogic infrastructure editor.

  18. Defect SCR 2497 - It takes 15 minutes to re-open graphical console after client computer crashed while graphical console was open

    If the client has the graphical console open and they lose connection to the internet (client network card failure, client computer crash, internet access is unavailable, etc.), it will take 15 minutes to re-open the graphical console.

  19. Defect SCR 2548/ SCR 2549 - Issues when using the CA AppLogic graphical console with Ubuntu

    The mouse is hard to use in Ubuntu when using the CA AppLogic graphical console. This is due to a limitation of the Xen VNC support (mouse acceleration is not supported). Some users report that adjusting the mouse settings in Ubuntu resolves the issue. Also, rarely keystrokes will be repeated several times when typing in text from the keyboard (in such cases, simply delete the extra characters that are displayed).

  20. Defect SCR 2498 - All text entered by the user in the text boot console is echoed to the console

    This includes passwords when logging into an appliance. The text boot console should only be used for debugging purposes. The SSH console can be used instead for all other purposes.

  21. Defect SCR 2501 - User must press enter to see output in the text boot console after it is opened for the second time

    If a user re-opens the text boot console for an appliance after it has already been opened, they must press the enter key to see either the login prompt or the command prompt. This is because the boot console is waiting for user input (either for login information or a command to be executed).

  22. Defect SCR 3107 - Appliances in failover groups are not accounted for when restarting the grid controller on a secondary server

    If a grid has an appliance that is part of a failover group running on a secondary server where the grid controller needs to be restarted, CA AppLogic may stop that appliance which could break the failover group.

  23. Defect SCR 2134 - Grid upgrade causes an incorrect warning about the cause of the grid reboot

    After upgrading a grid to the latest release, a dashboard message is posted stating that the grid failed due to a hardware issue. This message can be safely ignored and removed from the dashboard.

  24. Defect SCR 3709 - Appliances become temporarily inaccessible (5 min) if external NIC fails in network HA configuration

    If using a network HA configuration with CA AppLogic and there is an external network failure, applications/appliances that use external interfaces may become inaccessible for up to 5 minutes. This appears to be caused by the external router caching MAC addresses. Waiting for the router to flush its ARP cache or sending an ARP response with arping from the application restores operation. This only affects the external network (the backbone network is not affected).

  25. Defect SCR 4159 - Recovery GUI does not work on ESX-based servers

    The recovery GUI only works on Xen-based servers.

  26. Defect SCR 4273 - MON counters for shared interfaces do not work properly

    Shared interfaces do not support appliance counters.

  27. Defect SCR 5242 - System Up time is not reset after the Grid power cycle is done

    If a user power-cycles a grid, the system uptime is not reset. If the grid is rebooted, the system uptime should be reset.

  28. Defect SCR 5269 - The grid power_cycle command may fail to reboot the primary server

    If a user power-cycles a grid using the grid power_cycle command, the primary server may fail to reboot. This only occurs when the command is executed after a new grid install and the grid was never rebooted before the power cycle command was executed. Rebooting the grid at some point after a new grid install will avoid this issue.

  29. Defect SCR 6378 - SAN-NFS: Deletion of running/failed grid leaves behind the grid's folder (but contents of the folder are deleted)

    When a grid that used a SAN is destroyed, CA AppLogic deletes the contents of the grid’s folder on the SAN, but leaves behind the empty folder. This issue will be resolved in a future release.

  30. Defect SCR 6701 - Upgrade from 3.0 or 3.1 to 3.5 rarely fails

    Very rarely, an upgrade to 3.7 from either 3.0 or 3.1 may fail. In this particular upgrade failure case, the following messages are present in the grid’s status log accessed using the BFC (click on the status of the grid to open the log).

    installing the controller image
    ioctl: LOOP_SET_FD: Device or resource busy
    installing new controller FAILED, aborting 
    

    If these messages are present in the log, rerun the upgrade again and it should succeed.

    Note: This issue is actually a bug in both CA AppLogic 3.0 and 3.1, and is resolved in CA AppLogic 3.7.

  31. Defect SCR 7048 - ESX grid rollback from 3.5 to 3.1 does not work

    The rollback command does not work from 3.5 to 3.1 for an ESX-based grid. However, as a workaround, the downgrade command can be used (note that downgrade takes a bit longer than rollback). This issue will be resolved in a future release.

  32. Defect SCR 7064 - ext3-snapshot based volumes do not work on ESX-based grids

    Ext3-snapshot based volumes do not work on ESX-based grids. However these volumes work on Xen-based grids. If you are using an ESX-based grid and you need to use an ext3-snapshot volume, you can add a Xen-based node to your grid and use that node to create/manage your ext3-snapshot volumes (when running the volume commands, disable all of the ESX servers so the CA AppLogic filer will run on the Xen-based node). This issue will be resolved in a future release.

  33. Defect SCR 7397 - “vol migrate” failed to migrate one of the volume streams from the local SAN (incorrectly trying to migrate stream to external SAN)

    An attempt to migrate a volume stream on the local SAN might fail on grids that are configured to use an external SAN. Iinstead of migrating the volume stream to the local SAN, CA AppLogic incorrectly tries to migrate the stream to the external SAN. If you encounter this failure, use the store=local option with the vol migrate command. This issue will be resolved in a future release.

  34. Defect SCR 7394: Grid controller hang after upgrading CA AppLogic from 3.0.30 to 3.5.x

    When CA AppLogic is upgraded from 3.0.30 to 3.5.x, the grid controller intermittently hangs and any 3tshell command executed returns a low memory condition error message.

    To work around the issue, reboot the grid controller. This issue will be resolved in a future release.

    Note: This could affect the 3.7 release as well.

  35. Defect SCR 8845: Resize of an NTFS volume may take a long time and appear to be stuck

    While resizing very large NTFS-based volumes (many GBs in size), the resize operation may stop reporting progress and will appear to be stuck. However the resize operation is indeed progressing and will be completed successfully. This reporting issue will be fixed in a future release.

  36. Defect SCR 8751: Performance issues with megaraid_sas driver in Applogic 3.1+ .

    Since CA AppLogic 3.1+, the megaraid SAS driver’s performance is degraded and operates ~75% slower as compared to a physical server. CA is currently working on resolving this issue and will release a hotfix as soon as the issue is identified and fixed. Until this issue is resolved, it is strongly recommended to use a different type of disk controller.

  37. Defect SCR 8908: Upgrading to the latest Windows APK for a Windows 2008 Enterprise appliance will fail.

    There is an issue with upgrading Cygwin while trying to upgrade to the latest Windows APK that is distributed with CA AppLogic 3.7. It is recommended to build a new Windows appliance rather than upgrading until this issue is fixed. This issue will be fixed in a future release.

  38. Defect SCR 8468: 3tshell does not allow spaces in property values when invoked over ssh .

    When executing 3t commands over ssh, the parameters are being split either on a space or on a back tick (`), depending on the way the command is invoked. If a 3t command has a property value with a space in it, the characters after the space will incorrectly be treated as a separate argument. This issue will be fixed in a future release.

  39. Defect SCR 8714: The ORACLE appliance ignores the http_port property.

    The http_port property is ignored; as such the port will always be 8080. This issue will be fixed in a future release.

  40. Defect SCR 8888: For Xen-based grids, there is a limit of 90 HVM-based appliances.

    Attempts to start more than 90 HVM-based appliances on Xen-based grids may fail with mount or appliance start errors. This is a known issue and will be fixed in a future release.

  41. Defect SCR 8914: The graphical console does not work on Mac Safari 6.0.4 with JAVA 7 Update21 (64bit).

    Please use an older version of Safari or refer to the following link for a possible workaround for this issue.

Known Issues Specific to Windows-based Appliances

The following are the key known problems with Windows appliances in this release. Also, see the Windows Appliance Installation Reference for additional procedures and notes.

  1. Defect SCR 8051 - Unable to install the Halsign Turbogate Drivers on Microsoft Windows 8 (32-bit) appliance.

    32-bit Windows 8 is currently not supported by the Halsign Turbogate Drivers; however the 64-bit version of Windows 8 is supported. This issue will be fixed in a subsequent release.

  2. Defect SCR 7899 - No Error is observed when starting multiple Windows-based applications with the same assigned public IP addresses. Also none of the applications are accessible with the configured public IP address.

    The Windows APK currently does not correctly detect duplicate IP address assignments. Therefore it is up to the user to determine if they have accidentally assigned duplicate IP addresses. This issue will be fixed in a subsequent release.

  3. Defect SCR 2751 - Windows filer volume resize can fail on a volume with a corrupt file-system.

    The Windows filer can fail a volume resize operation if the source volume contains a corrupt directory entry/file. The main source of this problem comes from the fact that some of the Microsoft software installations purposely contain invalid directory entries (we are not sure why this is; this has been observed when a user installed a version of Microsoft SQL Server in their appliance). Additionally, the source volume can be corrupt due to normal wear and tear. This issue can be worked around by running a file system repair on the volume (vol fsrepair) before resizing the volume.

  4. Defect SCR 3078 - Resizing an NTFS volume failed due to a Windows filer start failure.

    It has been observed by CA that the NTFS volume resize operation fails about two times out of 100. These two failures occurred because the Windows filer failed to start correctly on the grid. If this issue is observed, repeating the resize operation a second time should succeed. This issue however should be resolved in this release; if this issue is observed, notify CA technical support.

  5. Defect SCR 2750 - Windows filer failed to create an ntfs volume (rare diskpart error).

    The Windows filer uses a Microsoft utility named diskpart to deal with the Windows NTFS volumes. Occasionally diskpart fails to obtain volume information or may fail to mount the volume. This is a very rare failure and may cause either vol create or vol resize to fail over NTFS volumes.

  6. Defect SCR 2748 - Windows appliances occasionally detect duplicate IPs on their internal network.

    If the user has an application that contains a Windows appliance and one or more Windows appliances are added to the app or terminals are added or removed from the Windows appliances, during the first app start some of the Windows appliances may detect duplicate IPs on their internal network (this can only happen during the first app start after the application is modified). This should not cause any operational failure of the application or require user intervention; the duplicate IP addresses are purely temporary. Worse case, some of the network communication involving any of the Windows appliances may be delayed for up to 30-60 seconds.

  7. Defect SCR 2504 - Occasionally disk read/write counter values are reported as zero (Windows perfmon API bug).

    Occasionally zeros are reported for the following disk I/O counters for Windows appliances (even though sustained I/O is being generated): Total bytes written/read, # of volume writes/reads, time spent in writes/reads. This is due to a bug in the Windows perfmon API - the zero values is what is being reported by the Windows perfmon API.

  8. Defect SCR 2821 - The Windows filer MSI do not work under localized Japanese Windows.

    Other than the filer MSI, localized Japanese Windows should work under CA AppLogic.

  9. Defect SCR 2862 - Windows appliance fails to start if a virtual DVD-ROM device is installed.

    A windows appliance fails to start if the MagicISO virtual DVD-ROM device is installed. Virtual DVD-ROM devices are not currently supported in CA AppLogic for windows-based appliances.

  10. Defect SCR 2499 - It can take several minutes to discover new NICs in Windows appliances which can cause boot timeouts.

    Occasionally it takes several minutes for Windows to detect new NICs inside of an appliance. This occurs when the user adds/removes terminals for a Windows appliance singleton. The extra time it takes to detect these new NICs may cause appliance boot timeouts. To workaround this, increase the boot timeout of your Windows appliance.

  11. Defect SCR 2505 - Migration of a windows appliance to another grid may trigger re-activation of the Windows appliance.

    If a user has a Windows appliance on their grid and they migrate the appliance to another grid that has different hardware, the Windows appliance may require re-activation (Microsoft's Windows re-activation). The re-activation is triggered when a specific amount of hardware has changed (it is unknown to CA exactly what hardware changes trigger the re-activation). Note that re-activation may require access to the internet from within the Windows appliance. This particular problem was observed after resizing the Windows appliance boot volume and migrating the appliance to a different grid.

  12. Defect SCR 3814 - Windows 2008 filer root access permissions are limited using ssh.

    This issue only affects Windows 2008 Server 32/64-bit (Windows 2003 server works OK). When accessing a Windows 2008 volume either through the filer using ssh to an appliance, the user may not be able to access/modify files due to permission issues. To access and modify files using the command shell, log in through the graphical console to the Windows desktop and open up a command shell. The command shell can be used to access and modify files.

  13. Defect SCR 8907: Unable to start WIndows 2008 DataCenter appliance with more than eight CPUs (Xen-based grids only).

    Currently Windows 2008 DataCenter edition appliances fail to start if configured to use more than eight CPUs (only on Xen-based grids).